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Foreword 


This  collection  of  technical  reports  deals  with  estimation 
of  spectra  of  stationary  processes,  both  by  the  now-standard 
direct  approach  and  by  the  more  recent  autoregressive  approach. 
The  topics  range  over  numerical  procedures  involving  Fast 
Fourier  Transforms,  cross  spectral  estimation,  minimum  bias 
windows,  vernier  FFTs,  coherence,  univariate  and  multivariate 
maximum  entropy  spectral  estimation,  and  probability  distri¬ 
butions  of  spectral  estimates.  These  results,  which  were  new 
when  puMshed,  are  stM  of  great  relevance  to  anyone  doing 
spectral  analysis  who  is  interested  in  obtaining  good 
resolution  and  stabWty  from  limited  record  lengths. 


Dr.  WMam  A.  Von  Winkle 
Associate  Technical  Director 
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Numerical  Evaluation  of 
Cumulative  Probability 
Distribution  Functions 
Directly  from 
Characteristic  Functions 


Albert  H.  Nuttall 

ABSTRACT 

A  method  for  direct  numerical  evaluation  of  the 
cumulative  probability  distribution  function  from 
the  characteristic  function  in  terms  of  a  single 
integral  is  presented.  No  moment  evaluations  or 
series  expansions  are  required.  Intermediate 
evaluation  of  the  probability  density  function  is 
circumvented.  The  method  takes  on  a  special  form 
when  the  random  variables  are  discrete. 
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NUMERICAL  EVALUATION  OF  CUMULATIVE  PROBABILITY 
DISTRIBUTION  FUNCTIONS  DIRECTLY  FROM 
CHARACTERISTIC  FUNCTIONS 


INTRODUCTION 

When  several  independent  random  variables  are  added,  the 
characteristic  function  of  die  sum  is  the  product  of  the  characteristic  functions 
of  the  individual  random  variables.  This  rule  holds  regardless  of  the  distri¬ 
butions  of  the  individual  random  variables,  and  whether  they  are  identically 
distributed  or  not.  Evaluation  of  the  cumulative  probability  distribution  of  the 
sum  variable  in  closed  form  is  often  very  tedious  or  impossible  to  achieve. 

This  is  especially  so  when  the  number  of  random  variables  added  is  large,  but 
not  large  enough  to  employ  the  Central  Limit  Theorem  with  accuracy. 

In  many  signal-detection  problems,  the  characteristic  function  of  the 
decision  variable  can  be  derived  in  closed  form  (or  evaluated  numerically 
fairly  easily).  Often,  however,  neither  the  probability  density  function  of  the 
decision  variable,  nor  its  integral,  the  cumulative  probability  distribution 
function,  can  be  obtained  in  closed  form.  Even  if  they  can,  they  are  frequently 
tedious  and  time-consuming  to  evaluate  (see,  for  example,  Marcum1).  In  this 
report,  we  present  a  technique  for  numerically  evaluating  cumulative  probability 
distribution  functions  directly  from  specified  characteristic  functions  in  terms 
of  a  single  integral.  Intermediate  evaluations  of  the  probability  density  func¬ 
tions  are  not  necessary,  and  no  moment  evaluations  or  series  expansions  are 
required.  The  technique  takes  on  a  special  form  when  the  decision  variable  is 
discrete. 

When  the  characteristic  function  of  the  decision  variable  (which  is  com¬ 
pared  with  a  threshold)  can  be  evaluated  for  both  the  signal-present  and  signal- 
absent  cases,  the  technique  can  be  applied  to  the  problem  of  obtaining  receiver 
operating  characteristics  (probability  of  detection  versus  probability  of  false 
alarm). 


ANALYSIS 

This  section  is  composed  of  two  subsections.  In  the  first,  a  general 
formula  for  direct  evaluation  of  the  cumulative  probability  distribution  function 
from  the  characteristic  function  is  derived;  in  the  second,  an  alternate  and 
more  useful  form  for  discrete  random  variables  is  presented. 
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GENERAL  DISTRIBUTIONS 


Let  random  variable  x  have  probability  density  function  (PDF)  p(x)  and 
characteristic  function  (CF)  f(f): 


and 


m  -  /■*  exp  (l*x)  p(x), 
P«  -  ^*/d|  exp  <-i$x)  m. 


(1) 

(2) 


(An  integral  without  limits  is  over  the  real  axis  from  -•  to  +« .) 


The  cumulative  distribution  function  (CDF)  Pr (X)  is  defined  as  the 
probability  that  random  variable  x  is  less  than  or  equal  to  X : 

X+ 

Pr®  -  f  dxp(x).  (3) 


The  upper  limit  means  that  an  impulse  in  PDF  p(x)  at  x  =  X  is  to  be  included 
in  fhll.  It  will  be  convenient  to  define  the  modified  distribution  function  (MDF): 


P® 


■L 


dx  p(x), 


(4) 


where  an  impulse  in  p(x)  at  x  =  X  is  only  half  included.  At  points  of 
continuity  of  the  CDF,  Pr®  and  P®  are  equal.  At  a  point  of  discontinuity 
of  the  CDF,  the  MDF  P®  takes  on  a  value  halfway  between  the  limit  values 
on  either  side  of  the  discontinuity. 2  The  CDF  Pr®  can  be  obtained  from  the 
MDF  P®  via 


Pr®  *  lim  P®  +  «). 
**■©+ 


(5) 


Therefore,  we  can  direct  our  effort  to  evaluating  either  the  CDF  Pr®  or  the 
MDF  P®,  depending  on  which  is  more  convenient. 
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When  Eq.  (2)  is  substituted  into  Eq.  (4),  we  not"  that  the  MDF 
becomes^ 


PCD  - 

-oo 


exp  (-i*x)  t  (|) 


dxexp(-ifx) 


^ «P (-lGpj 


T"  ^/Tf«)expH®* 


(6) 


where  the  last  integral  is  a  principal  value  integral.  Since  the  PDF  p(x)  is 
real,  the  real  part  of  the  CF  f(£)  is  even,  and  the  imaginary  part  of  the  CF 
f(£)  is  odd;  l.e.,  f(-{)  »  This  allows  Eq.  (6)  to  be  manipulated  into  the 

forms 


00 

P(X)  -  Y~  f  Y  Im{f(f)exp(-i£X)} 

0 

00 

=  y* [lm|f({)}cosCT- Re{f({)}sin(®j  .  (7) 

0 

Convergence  of  the  integrals *  at  the  origin  is  guaranteed  by  the  fact  that 


f  i*  integrable  »t  the  origin  if  P>-1.  No  moments  of  the  distribution 
•re  required  to  exist. 
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Im  {  f(0) }  *  0 


UmjtoJ!£  .  x 


Equation  (7)  is  the  general  equation  allowing  numerical  evaluation  of  the 
MDF  PCD  directly  from  the  CF  f(£).  For  a  discontinuous  CDF,  in  order  to 
minimize  inaccuracies  in  a  numerical  evaluation  of  Eq.  (7),  values  of  the  MDF 
P(D  points  removed  from  the  discontinuity  locations  (if  known)  should  be 
computed.  In  particular,  for  a  discrete  random  variable,  values  of  the  MDF 
at  points  midway  between  discontinuities  should  be  computed  when  using  | 

Eq.  (7). 

The  integral  in  Eq.  (7)  is  confined  to  the  real  axis.  Since 


p(x)  »  1  for  £  real. 


there  are  no  singular  points  along  the  £  axis.  Also  some  CF's  are  defined 
only  for  £  real;  for  example,  for 


p(x)  = 


r  2  * 

Hx 


f(£)  «  exp<-|£|),  real  £,  (12] 

but  f(£)  is  not  defined  for  complex  £.  Thus,  the  CF  f(£)  does  not  have  to  be 
analytic  at  the  origin  to  apply  Eq.  (7).  Nor  do  any  moments  of  the  random 
variable  have  to  exist. 


DISCRETE  DISTRIBUTIONS 

The  expression  (7)  applies  to  all  MDF's  (and  CDF's  through  Eq.  (5)); 
however,  it  requires  an  infinite  integral  for  each  value  of  X  Here  we  shall 
alleviate  this  requirement  for  a  special  class  of  random  variables.  Namely,  we 
consider  discrete  random  variables  that  can  only  take  on  values  which  are 
multiples  of  some  fundamental  Increment  A.  That  is,  the  PDF  of  interest 
takes  the  form 


cR  i(x-kA). 
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(A  sum  without  limits  is  over  the  integers  from  -»  to  +» . )  Then  the  CF  is 

f(0  =  23  c.  exp(ikAO,  (14) 

k  * 

which  is  periodic  with  period  2r/A.  Therefore,  the  coefficients  Jc^l  can  be 
determined  from  the  CF  f(£)  by 

°k  “  £  f  dSexp(-ikA*)f(*)  ,  (15) 

2r/A 

where  the  integral  is  over  any  interval  of  length  2*-/ A. 

Equation  (IS)  gives  the  area  of  any  impulse  in  the  PDF  p(x)  in  terms  of 
a  finite  integral  of  the  CF  f(£).  Since  we  are  interested  in  the  CDF  Pr(X),  a 
sum  over  |c^|  is  required.  At  this  point,  it  is  convenient  to  distinguish  tvo 
cases:  (1)  nonnegative  discrete  random  variables  and  (2)  general  discrete 
random  variables. 


NONNEGATIVE  DISCRETE  RANDOM  VARIABLES 

If  x  is  a  nonnegative  discrete  random  variable,  the  CDF  is,  at  integer 
value  M, 

M  f  M 

Pr(M)  ck  =  —  J  d*  f(*)£exp(-ikA$),  M>0,  (16) 

k*°  2#/A  k=0 


where  we  have  substituted  Eq.  (15).  Now 
M 

exp  (»ikA()  « 
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which  must  be  interpreted  as  M  +  l  at  |»0,  +2*/ A,  +4t/A,  ....  Using 
Eq.  (17)  and  the  fact  that  f(-{)  ■  f*({)»  we  cote  that  Eq.  (16)  becomes 

2  */A 


*  T  /  i««»*><-lMAt/2)}  .  M>0,  as) 

where  the  interval  (-  t/A,  */A)  has  been  selected  for  integration.  The  ratio 
of  sines  is  interpreted  as  M  +  l  at  the  origin  (  »  0.  Equation  (18)  is  a 
single  finite  integral  from  which  the  CDF  Pr(M)  can  be  evaluated  at  any  M 
directly  from  the  CF  f({). 

A  special  case  of  Eq.  (18)  is 

ff/A 

Pr(0)  *  c o"/  d$  Re  { f ({) }  .  (19) 

0 

(Actually,  c0  is  always  given  by  this  formula,  even  for  general  discrete 
random  variables,  as  may  be  seen  from  the  general  formula  (Eq.  (15)).) 

The  case  of  a  discrete  random  variable  taking  on  values  in  a  semi¬ 
infinite  range  (i.  e. ,  (-«,N)  or  (N,«),  where  N  is  finite  but  can  be  positive 
or  negative)  can  be  handled  in  a  similar  fashion.  The  key  is  that  a  finite  sum  of 
exponentials  (like  Eq.  (17))  can  be  evaluated  without  requiring  a  summation. 


GENERAL  DISCRETE  RANDOM  VARIABLES 

Here  we  shall  consider  discrete  random  variables  which  can  take  on 
values  in  the  range  (  <*).  From  Eqs.  (7),  (4),  and  (13), 

00 

P<»)-y-V (20) 

o 


-1 


•E 


ck  +Jco 


(21) 
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That  is,  the  value  of  the  MDF  P(X)  at  the  origin  can  be  evaluated  by  a  single 
infinite  integral.  There  does  not  seem  to  be  any  simpler  way  of  obtaining  this 
number,  which  will  be  necessary  in  the  development  to  follow.  In  some  cases, 
it  may  be  possible  to  evaluate  the  particular  value  P(0)  from  the  integral 
Eq.  (20)  in  closed  form,  or  expand  it  in  a  rapidly  convergent  series,  while 
PCS)  could  not  be  so  evaluated  generally  forZ^  0.  In  any  event,  Eq.  (20)  will 
be  the  only  Infinite  integral  necessary  to  evaluate  in  order  to  get  the  complete 
CDF  for  this  general  discrete  case. 

The  area  of  the  impulse  at  the  origin  is  given  by  Eq.  (15)  as 

*/A 

co-«7  f  d*Re{f(*)}.  (22) 

0 


Now  let  us  define  auxiliary  functions 


v-£ 


c.  ,  M> 0  , 


0 

s.(M>“J[^v  m-°-  <*4> 

By  a  development  similar  to  Eqs.  (16)  through  (18),  we  find  that  these  auxiliary 
functions  can  be  expressed  directly  in  terms  of  the  CF  f({)  as 

Re)f(0exp(TiMA«/2)},  MJ>0,  (25) 

where  the  ratio  of  sines  is  interpreted  as  M  +  1  at  the  origin  $  *  0. 

The  CDF  Pr(M)  then  can  be  evaluated  at  any  M  according  to 

p(°)-)c  +S+(M),  M>0 
Pr(M) »  * 

p(0)+|c  -3  (|M+1|),  M<0  .  (26) 
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Here  P(0)  Is  given  by  Eq.  (20),  c0  by  Eq.  (22),  and  S+\(M)  by  Eq.  (25).  The 
constants  P(0)  and  cQ  need  be  evaluated  once,  but  Eq.""(25)  must  be  evaluated 
for  each  M  of  interest.  However,  Eq.  (25)  is  a  finite  integral. 


EXAMPLES 

We  shall  consider  two  examples  recently  examined  by  Heist rom*  for 
purposes  of  comparison. 

Example  1  -  Exponential  Distribution 


(27) 

x>°  1 

X<oJ,  (28) 

HO  *  <l-i«f 1  .  (29) 

The  exact  CDF  is  given  in  Eq.  (28).  Approximate  values  for  the  CDF  are 
obtained  by  substituting  Eq.  (29)  into  Eq.  (7)  and  approximating  the  infinite 
integral  by  a  finite  sum.  Results  are  indicated  in  Table  1. 

The  integral  of  Eq.  (7)  was  sampled  in  (  at  values  indicated  by  column 
four  of  Table  1  and  approximated  by  the  trapezoidal  rule  for  Integration.  The 
limit  of  integration  in  Eq.  (7)  was  taken  to  be  the  value  above  fiO  where  the 
finite  sum  deviated  most  from  the  exact  answer.  Thus,  the  finite  sum  in 
column  three  of  Table  1  is  the  worst  approximation  to  the  exact  answer  in 
column  two. 

For  this  example,  the  largest  error  occurred  at  the  origin.  This 
happened  because  the  integrand  of  Eq.  (7)  oscillates  for  0,  thereby  con¬ 
verging  fairly  rapidly,  whereas  the  integrand  decreases  monotonically  only  as 
(1+*2)-1  forX*  0. 


P(x) 


-{ 


Pr(X)  -  P(X) 


exp(-x),  x>0  j 
0,  x  <  0  /  , 

1  -  exp  (-X), 
0, 


■r 
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Table  1 


NUMERICAL  COMPUTATION  OF  EXPONENTIAL  DISTRIBUTION 


Pr(U 


Finite  Sum  Increment  Approximate  Limit 
via  Eq.  (7)  in  i  of  Integration 


.00001 


I'M1!1/ 


.18127 


.63212 


.86466 


.9999546 


.00008 


.00532 


.18096 


.63220 


.86470 


. 9999637 


Example  2  -  Poisson  Distribution 


OO 

p(x)  -  exp(-X)  ]P  —  6(x-k), 

fco  k! 


Pr(M)  = 


exp(-  x)  ^  ^7  *  M  >  0 


0,  M  <  0 


f({)  -  exp[x{exp(i{)-l|]  .  (32) 

The  exact  CDF  is  given  in  Eq.  (31).  Approximate  values  for  the  CDF  are 
obtained  by  substituting  Eq.  (32)  into  Eq.  (18),  with  <3-1,  and  approximating 
the  finite  integral  by  a  finite  sum.  Results  are  indicated  in  Table  2. 


Tablet 

NUMERICAL  COMPUTATION  OF  POISSON  DISTRIBUTION 


Pr(M) 


.46545  37085 


.66412  32006 


.51701  50655 


.55566  15601 
.55560  26867 


.55555  55765 


Finite  Sam  via  Eq.  (18) 


.00000  03065 
.00000  46545 
.00763  18558 


.46565  37065 


.66412  32004 


.51708  50656 
.55666  15500 
.55560  26865 
.55555  65764 


Tba  Integral  of  Eq.  (18)  m  divided  into  25  aqual  interval*  and  approxi¬ 
mated  by  tba  trapoaoidal  rul«  for  integration.  Columns  two  and  thraa  of  Table  2 
•bow  that  tba  arror  in  tba  approximation  occurs  in  tba  tenth  placs  (and  may  bs 
duo  to  computer  inaccuracies  rather  than  sampling  errors).  Also,  tba 
accuracy  bolda  on  the  tails  of  tba  CDF  aa  wall  aa  near  tba  mean. 


CONCLUSIONS 

Tba  anmarical  technique  suggested  for  obtaining  CDF's  directly  from 
CF*s  baa  conaidarabla  merit.  It  requires  no  moment  evaluations  or  series 
scansions  (like  Edgeworth  or  Laguerre)  for  tba  distributions.  It  does  not  de¬ 
pend  upon  evaluation  of  derivatives  of  CP's,  but  depends  only  upoo  the  values  of 
the  CF  itself.  (Evaluation  of  higb-order  derivatives  can  be  extremely  tedious 
and  time- consuming  even  if  an  analytic  form  (Or  the  CF  is  available. )  The 
accuracy  of  the  suggested  technique  can  be  estimated  and  controlled  by  de¬ 
creasing  the  increment  in  the  Integral  evaluations  or  lengthening  the  interval  of 
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hrtnattoa  or  both;  tho  ohaafi  la  tho  iffwdartha  la  a  moony  of  tho  orror 
at  that  pot  at.  Tho  mothod  dooo  not  roqulro  aa  taordlnato  anmbor  of  oamptoo  of 
tho  CF,  at  laaat  far  tho  oanunpioo  oonatdarod,  and  tho  addttttooal  ftmottaoo 
roqolrlaf  oratnattoa  aro  olaoo  and  oootnao.  btoraaodlato  oraluattoB  of  tho 
PDF  la  oatlraly  dr  onmroatod.  (01  oovroa,  optima  too  of  tha  PDF  aro  avail- 
ahlo  ao  dlfforoaoaa  of  tho  CDF,  if  doairod. ) 
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Alternate  Forms  and 
Computational 
Considerations  for 
Numerical  Evaluation  of 
Cumulative  Probability 
Distributions  Directly 
from  Characteristic 
Functions 


Albert  H.  Nuttall 

ABSTRACT 

Alternate  Integral  forms  for  the  cumulative  prob¬ 
ability  distribution  in  terms  of  the  character¬ 
istic  function  are  given.  In  particular,  forms 
that  can  utilize  a  fast  Fourier  transform  (FFT) 
algorithm  and  special  forms  for  one-sided  proba¬ 
bility  density  functions  are  derived.  For  a 
special  class  of  discrete  random  variables,  all 
Integral  evaluations  are  over  a  finite  rarge. 

Some  computational  aspects  of  utilizing  tie  FFT 
are  discussed. 
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REVERSE  BLANK 


ALTERNATE  FORMS  AND  COMPUTATIONAL  CONSIDERATIONS  FOR 
NUMERICAL  EVALUATION  OF  CUMULATIVE  PROBABILITY  DISTRIBUTIONS 
DIRECTLY  FROM  CHARACTERISTIC  FUNCTIONS 


1.  INTRODUCTION 

A  recent  report  [l]  on  numerical  evaluation  oi  cumulative  probability  dis¬ 
tribution  functions  directly  from  characteristic  functions  (OF)  gave  the  cumula¬ 
tive  distribution  functions  (CDF)  in  terms  of  a  single  integral  on  the  CF  for  both 
continuous  and  discrete  random  variables  (RV) .  In  this  report  some  alternate 
forms  for  the  CDF  in  terms  of  the  CF  will  be  presented,  with  an  aim  toward 
more  accurate,  efficient,  and  expeditious  calculations.  For  the  motivation  of 
this  study  and  utility  of  the  results,  as  well  as  numerical  examples,  see  Ref¬ 
erence  1. 


2.  ANALYSIS 

This  section  is  composed  of  five  subsections.  In  the  first,  general  dis¬ 
tributions  are  considered;  in  the  second,  specialization  to  a  nonnegative  random 
variable  is  made.  In  both  subsections,  forms  that  utilize  a  fast  Fourier  trans¬ 
form  (FFT)  are  derived  and  their  applicability  is  discussed.  In  the  third  and 
fourth  subsections,  discrete  random  variables  are  considered.  The  former 
subsection  shows  that  the  distribution  function  can  be  evaluated  entirely  in 
terms  of  finite  integrals;  the  latter  subsection  specializes  to  nonnegative  dis¬ 
crete  random  variables.  The  fifth  subsection  treats  some  computational 
aspects  of  the  FFT . 


2.1  GENERAL  DISTRIBUTIONS 

Let  RV  x  have  probability  density  function  (PDF)  p(x)  and  CF  f(£): 

Hi)  =  J dx  exp(ifx)  p(x)  ,  (1) 

P(x)  "  27  /«  exPH$x>  Hi)  •  (2) 
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(Integrals  without  limits  are  over  the  real  axis  from  -«  to  «  .)  The 
CDF  Pr(X)  is  defined  as  the  probability  that  RV  x  is  less  than  or  equal  to  X. 
The  modified  distribution  function  (MDF)  P(X)  is  defined  equal  to  Pr(X)  at 
points  of  continuity,  but  it  takes  a  value  midway  between  limit  values  on  either 
side  of  a  discontinuity. 

The  MDF  P(X)  can  be  obtained  from  the  CF  by  [l,  Eq.  (7),  or  2,  Eq. 
(4.14)] 


P  (X) 


Im  jf($)  exp(-i£X)  |, 


all  X  . 


(3) 


If  we  attempt  to  remove  the  Imaginary  operation  from  under  the  integral  sign, 
we  obtain  an  infinite  integral  since  f(0)  =  1.  However,  if  we  express 

m  -  [f«)  -  a(*)]  +  a($)  ,  (4) 


where  a(0)  =  if  and  split  (3)  into  two  integrals,  we  can  move  the  Imaginary 
operation  out  of  the  first  integral  in  (3).  One  particularly  useful  choice  for 
a({) ,  which  results  in  a  closed  form  expression  for  the  second  integral  in  (3),  is 

ax(£)  -  exp[W  "  f  *  *2]»  *  >  0  ,  (5> 

where*  n  and  <r2  are  the  mean  and  variance  of  RV  x.  The  mean  and  vari¬ 
ance  are  available  from  f(0)  and  f'(0),  if  these  quantities  can  be  evaluated;  if 
not,  the  method  to  be  described  is  still  applicable  with  arbitrary  constants  used 
for  m  and  <r2.  When  (4)  and  (5)  are  substituted  into  (3),  there  results  [3,  Eq. 
3.896  4;  integrate  both  sides  with  respect  to  b] 


P(X)  »  +1 


m  -  a1(() 


}• 


exp(-i*X)>,  all  X 


(6) 


'Actually,  M  and  could  be  assigned  arbitrary  value*  in  the  form  (5);  this 
particular  choice  give*  a  *econd-order  fit  to  f({)  at  the  origin. 
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where 


*(y) 


=  /*dt(2r)"1//2  exp(-t2/2) 
-00 


(7) 


is  the  Gaussian  CDF. 

Equation  (6)  is  now  in  a  form  where  an  FFT  can  be  utilized  in  the  integra¬ 
tion  on  (  (See  Subsection  2.5).  This  equation  is  exact;  we  are  not  making  a 
Gaussian  approximation  in  (6).  There  is  no  problem  in  the  integration  at 
because,  for  the  choice  of  n  and  <r2  as  the  mean  and  variance  of  RV  x, 


m  -  a1({) 
I 


0(*2) 


as  {  —  0+  . 


(8) 


Also,  since  |  a^)]  decays  as  exp(-*2{2/ 2),  the  decay  of  the  left  side  of  (8) 
for  large  (  will  often  depend  on  the  decay  of  f(£)/£  ;  this  decay  will  dictate  how 
far  the  integral  in  (6)  must  be  carried  out  for  specified  accuracy  in  P(X). 

Other  choices  for  a(£)  are  possible  and  sometimes  recommended.  For 
example,  if  the  mean  and  variance  of  RV  x  do  not  exist  (e.g. ,  p(x)  =  *•“! . 

(1  +x2)-l,  all  x),  we  might  choose 


a2({)  -  exp(-b()>  *  >  0  .  (9) 

To  best  match  f({)  near  the  origin,  we  could  choose 


b  =  -f'(0+)  =  |  f *  (0+)  |  (assuming  f’(0+)  real)  . 
Then  by  substituting  (4)  and  (9)  into  (3)  [3,  Eq.  3.941  l],  we  get 


(10) 


P(X)  =  ~  ~  arctan(X/b) 

Lt  If 


-ilm  j/d{ 


m  -  a2(f) 


exp(-ifX) 


}• 


all  X  . 
(ID 
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For  the  choice  of  b  in  (10), 


m  -  a  ($) 

- |— = — -  0(()  as  *  -  0+  ,  (12) 

so  no  problem  in  integration  arises  at  the  origin.  We  must  be  able  to  evaluate 
F(0+)  in  this  case  so  that  b  is  known,  and  it  must  be  real.  In  cases  where 
f  (0+)  is  not  known  or  is  infinite  (See  Appendix  A,  for  example),  the  above 
methods  are  inapplicable,  and  special  techniques  such  as  subtracting  out  the 
singularity  are  required. 


2.2  NONNEGATIVE  DISTRIBUTIONS 

When  RV  x  is  limited  to  nonnegative  values,  some  simplifications  in  the 
general  form  (3)  occur.  (The  case  of  nonpositive  RV  x  can  be  treated  in  a 
similar  fashion.)  First,  if  X  <  0  in  (3),  then  P(X)  *  0.  Letting  X  *  -a  yields 

00 

\  -  7  Jj  [fr«)  sin(a{)  +  f^)  cos  (a?)],  a  >  0  ,  (13) 


where  subscripts  r  and  i  denote  real  and  imaginary  parts,  respectively. 
Employing  (13)  in  (3)  for  X  >  0,  we  get 

00 

P(X)  =  | /f  tr(0  sin($X),  X  >  0  ,  (14) 

o 


or 


P(X)  =  1  -  |  /**  fjtf)  cos(£X),  X  >  0  .  (15) 

o 

Thus,  the  MDF  P(X)  can  be  evaluated  from  knowledge  of  either  the  real  part 
or  the  imaginary  part  of  the  CF  f(J).  For  X  ■  0,  neither  (14)  nor  (15)  is 
necessarily  valid,  and  we  must  resort  to  (3) . 
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There  are  computational  reasons  for  choosing  (14)  over  (15),  or  vice 
versa.  The  first  has  to  do  with  ease  of  calculating  fr(f)  versus  f}({).  For 
example,  in  Appendix  A,  for  p(x)  =  2/r  (1  +  x2)”1  for  x  >  0,  we  find  that 
fr(£)  is  a  simple  exponential ,  whereas  f}(£)  is  a  sum  of  exponential  integrals. 
Converse  examples,  where  fj({)  is  simpler  to  compute,  can  also  be  found. 

i 

The  second  reason  has  to  do  with  the  rate  of  decay  of  fr(f)  versus  f^(f). 

We  have 


00 

tTw  =  /  dx  p(x)  cos(fx)  p^(x)  cos  (f  x) ./dx  Pe(x)  exp  (if  X) 


(16) 


ft«) 


dx  PQ(x)  sin(fx)  =  i 


dx  pQ(x)  exp  (if  x)  , 


(17) 


where  subscripts  e  and  o  denote  even  and  odd  parts,  respectively.  Now,  if 
p(0+)  >  0,  then  pQ(x)  is  discontinuous  at  the  origin,  and  f^(f)  decays  only  as 
f  ”1  for  large  f  .  An  example  is 

p(x)  =*  e”X,  x  >  0;  f(f)  .  (1  -  if)-1  , 

fr«>  -  (i + s2y\  ft«)  -  *(i + f2)"1 .  as) 

In  (18),  fr(f)  decays  as  f  ”2  for  large  f  ,  giving  rise  to  an  integral  in  (14) 
that  can  be  terminated  earlier  than  the  one  in  (15).  On  the  other  hand,  consider 
that  p(0+)  =  0  and  that  p(x)  and  its  derivative  are  continuous  except  at  the 
origin,  but  p'(0+)  >  0.  Then  p0(x)  and  its  derivative  are  continuous,  whereas 
Pg(x)  is  discontinuous.  In  this  case,  fr(f)  decays  only  as  f”2  for  large  f. 
An  example  is 


p(x)  -  xe”X,  x  >  0;  f(f)  »  (1  -  if)"  , 

ff(f)  -  (l  -  t2)  (l  +  fjtt)  -  2f(l  +  {2)“2  .  (19) 
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Here  (15)  could  be  terminated  earlier  than  (14). 


The  third  reason  has  to  do  with  the  region  of  X  of  interest.  For  large  X, 
where  P (X)  is  near  unity,  Eq.  (15),  in  the  form 


1  -  P(X) 


oo 

=  1  fdt 


(?)  cos(?X),  X  >  0 


(20) 


is  to  be  recommended,  since  it  is  an  alternating  sum  of  small  quantities  and  re¬ 
tains  significance.  Equation  (14),  for  large  X,  is  an  alternating  sum  of  large 
quantities  and  loses  significance.  But  for  small  X,  Eq.  (14)  would  be  recom¬ 
mended. 

Equation  (15)  can  be  immediately  manipulated  into  a  form  where  an  FFT 
can  be  utilized.  Namely, 


P(X)  =  1  -  7  Re 


(  r  fi(*>  ) 

Jjdiy-  exp(-i?X)j, 


X  >  0  . 


(21) 


From  Appendix  A,  we  have  fi(?)/?  —  as  £  —  0,  if  mx  exists  and  is  finite. 
If  we  attempt  to  express  (14)  in  the  form 


Im 


fr($)  exp(i?X)>  , 


we  obtain  an  integral  that  does  not  converge  at  the  origin.  However,  if  we  ex¬ 
press 


fr<*>  =  [fr«)  “  b<?)]  +  b(?)  ,  (22) 

where  b(0)  =  1  and  b(?)  is  real,  then  (14)  becomes 
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P(X)  =  7  Ln 


I /•* 


t(&  ~  b«) 


exp(i£X) 


}•»/■ 


b«)  ,  X  >  0  , 


and  an  FFT  can  be  used  on  the  first  integral.  Preferably,  the  second  integral 
should  be  integrable  in  closed  form.  A  particularly  useful  choice  is 


b(£)  =  exp 


(-}'/)• 


$>  0  , 


where*  is  the  mean-square  value  of  RV  x.  This  quantity  is  available  from 
fji(0)  if  it  can  be  evaluated.  When  (24)  is  substituted  into  (23),  we  get  (See  (3) 
through  (7)) 


P(X)  =  2*^-^-  1  +  |lm  jy* d{ 


*  fr<*>  -  expH  V' 


exp(i{X)>,  X  >  0  . 
*  (25) 


The  function 


fr«)  -  exp(-  \ 


-0  as  (  -  0+ 


in  (25)  if  the  mean-square  value  m2  exists.  In  many  cases,  it  decays  as 
fr(£)/£  for  large 

The  fact  that  MDF  P(X)  can  be  obtained  from  either  the  real  or  imaginary 
parts  of  the  CF  for  a  nonnegative  distribution  are  manifestations  of  the  fact  that 
fr(£>  and  fj(£)  can  be  found  from  each  other;  in  fact,  they  are  related  by  Hil¬ 
bert  transforms.  For  p(x)  =  0  for  x  <  0,  and  no  impulses  at  the  origin,  we 
see  that  [4,  p.  38] 


*As  in  the  footnote  to  Eq.  (5),  M2  could  be  assigned  any  convenient  value. 
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Hi)  53  J dx  p<x)  exp(i£x)  -  f  P(x)  U(x)  exp(I$x)  »  3{p(x)  U(x)| 


=  3{p(x)}  »  3{U(x)}  =  f«)  •  «<*)  +  “  J  Cf«)  +  . 

(26) 

where  U  (x)  is  the  unit  step  function,  3  denotes  a  Fourier  transform,  • 
denotes  convolution,  and  X  denotes  a  Hilbert  transform.  Therefore, 

m  -  i*  {m\  ,  (27) 

or 

«i«)  “  fr«)  »  -X  fyal  •  (28) 

For  the  cases  when  p(x)  contains  an  impulse  at  the  origin  of  area  cQ,  the 
first  part  of  (28)  is  still  correct,  but  the  second  part  is  incorrect  by  the  addi¬ 
tive  constant  cQ.  However,  we  can  still  find  fr({)  from  fi(£)  by  utilizing  the 
fact  that  fr(0)  =  1.  Thus,  either  the  real  or  the  imaginary  part  of  the  CF  con¬ 
stitutes  complete  knowledge  about  the  MDF  in  the  case  of  a  nonnegative  dis¬ 
tribution. 


2.3  DISCRETE  DISTRIBUTIONS 

In  this  subsection,  the  RV  x  is  restricted  to  take  on  values  that  are  mul¬ 
tiples  of  some  fundamental  increment  A ,  and  can  be  either  positive  or  nega¬ 
tive.  Although  the  equations  in  Subsection  2.2  are  applicable  here,  it  is  advan¬ 
tageous  to  have  forms  for  the  distribution  function  that  require  finite  integrals 
rather  than  infinite  ones .  We  have  for  the  PDF 

P(x)  "  kA>  *  (2®) 

k 
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where  the  sum  range*  from  -«  to  »  .  The  CDF  Pr(M)  for  integer  M  i* 
given  in  Reference  1,  Eq*.  (20)  through  (26).  All  the  integral*  are  finite  inte¬ 
gral*  except  for  the  one  in  (2C)  for  MDF  value  P(0); 


1  i/LVS 

2  *  J  **  l  * 


We  now  rectify  this  situation  and  obtain  a  finite  integral  for  P(0)  also. 
From  Reference  1,  Eq.  (15), 


Ck  "  2r  I  d*  f({)  exP<”ikA*)  • 


Therefore,  by  using  Appendix  B  and  f(-  £)  -  f*(£).  we  get 


k— * 


k  2w 


k»-oo 


exp(ikA£) 


r 

t.  Jwl“ '«» I* 


-|°o  +  ‘«>c"(¥) 


1  i  a  f‘(f) 

2  co  +  2  "  2 w  J  d£  tan(A£/2) 


Then,  we  obtain  the  desired  result 
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(32) 


-1 

p<°>  ■ 


c.  +  *  c 
k  2  o 


1  -  A  f  df 

2  2  w  J  ' 


ftU) 

tan(Af/2) 


|(  s  •  oc 


As  f  ~  0+,  the  integrand  of  (32)  approaches  2m,/!  if  mx  exists  and  Is 
finite.  (There  is  no  integral  expression  for  P(0)  in  terms  of  fr(f ).  Since  fr(f) 


is  the  Fourier  transform  of 


Pe(*> 


(See  Eq.  (16)),  and  since 


c 

/ 


dx  Pe(X)  -  J  , 


irrespective  of  the  form  of  p(x),  fr(f)  contains  no  information  about  P(0). 
This  is  analogous  to  the  general  distribution  case  where  P(0)  follows  from 
(3)  as 

P(0)  -  \  -  7  fdk  f^fj/f  .  ) 
o 


2.4  NONNEGATIVE  DBCRETE  DISTRIBUTIONS 

When  RV  x  is  limited  to  nonnegative  values,  the  CDF  Pr(M)  takes  on 
forms  requiring  either  the  real  or  imaginary  parts  of  the  CF  for  its  evaluation, 
just  as  in  Subsection  2.2.  To  see  this,  we  note  that  in  (29)  is  zero  for 
k  <  0.  By  letting  k  »  -m  in  (30),  we  get 


df  sin(mAf)  f  (f) 


o 


/A 

df  cos(mAf)  ff({)  for  m  >  0  . 


(33) 


When  we  employ  (33)  into  (30)  for  k  >  0  ,  we  get 


c 


k 


df  coe(kAf)  fr(f), 


k  >  0  , 


(34) 
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-“/  * 


sin(kA*)  ft(f),  k  >  0 


Therefore,  the  CDF  Pr(M)  for  integer  M  is  given  by 


M  WA  M 

Pr(M)  ck  -  TT  /"  df  *rU>[j  cos<kA°] 

k«0  0  k«l 

■  T  /  d*  fr<*>  - ^  FI  %  ^  ,  M  >  0  , 

5  sin  I  2  U\ 


where  we  have  used  (34),  (30),  the  fact  that  f(-{)  -  P({),  and  Eq.  1.342  2  in 
Reference  3.  Equation  (36)  enables  evaluating  the  CDF  in  terms  of  the  real 
part  of  the  CF  alone. 

To  represent  Pr(M)  in  terms  of  f^(f),  we  first  note  that  for  nannegative 
RV,  the  general  formula  for  P(0)  in  (32)  becomes 

■  <3’ 


Now, 


Pr(M) 


c.  ■  c 
k  o 


/»■ A  M 

df  f.({)  ain(kA() 
k-1 
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where  we  have  employed  (35),  (37),  and  Eq.  1.342  l  in  Reference  3.  Equation 
(38)  la  complementary  to  (36)  in  the  eenae  that  only  the  Imaginary  pan  of  the 
CF  ia  neceeaary  for  evaluating  the  CDF.  The  reaaoaa  given  in  Subaection  2.2 
far  aelecting  (36)  or  (38)  in  a  particular  application  are  again  relevant. 


2.5  USE  OF  FFT  FOR  FOURIER  TRANSFORMS 

Many  o t  the  integrate  la  thia  report  taka  the  form 


g(t)  exp(-L2rft)  . 


Suppose  a  limit  T  on  the  integration  can  be  found  auch  that 


g(t)  exp ( -12 r ft) 


<  •  for  all  f  , 


(39) 


where  •  la  aome  apeclfled  tolerance  or  error.  Then,  attention  can  be  focuaed 
on  evaluating 


GTff)  • 


g(t)  exp(-i2rft)  . 


(40) 


Since  the  Integration  In  (40)  la  over  an  Interval  of  length  T,  it  ia  aeen  that 
G-r(f)  will  undergo  a  aignlf leant  change  in  value  in  an  interval  no  smaller  than 
1/T  in  f.  Thus,  one  might  initially  anticipate  that  (40)  should  be  evaluated  at 

values  of  f  ■  n/T,  n  ■  1,  2 .  However,  in  many  cases,  thia  resolution, 

l/T ,  may  be  much  too  fine,  and  be  the  result  of  satisfying  (39)  with  a  very 
small  «  .  In  such  cases,  values  of  G-p(f)  at  aome  multiple  of  the  fundamental 
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resolution  may  be  satisfactory,  uy  m/T,  where  m  ia  an  integer.  Thus,  we 
might  be  interested  only  in  evaluating  G^n™  j  ,  n  =  1,  2,  . . . .  But  from  (40) 


•(“?)  -/* 


g(t)  «txp(-L2nunt/T) 


m- 1  (k+1  )T/m 

y]  /  dt  g(t)  exp(-i2rnmt,  T) . 
k-0  kT/m 


In  making  the  substitution  u  ■  t  -  kT/m  in  (41)  and  defining  the  collapsed 
function 


«c<“)  “  k-0 


we  note  that  (41)  becomes 


^  |  g  ^u  ♦  k  0  $  u  <  T/m 


,  otherwise 


(“  t)  ■  /  du  *cm  “'“(•“'ffe  “) 


The  collapsed  function  gc  is  obtained  from  g  by  "pre-aliasing"  g  into  the 
interval  T/m.  If  we  define  the  Fourier  transform  of  gc  as 


>m"f 


du  g  (u)  exp(-t2rfu)  , 
c 


then  (43)  yields 
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4  ?)•«.(■?) 


That  is,  C*j.(f)  can  be  evaluated  at  f  ■  m/T,  2m/T , . . . ,  in  terms  of  the 
Fourier  transform  of  the  coll  speed  function. 

Now  suppose  ^(u)  is  sampled  at  increments  of  h  in  (44),  where 
h  =  ~T7~  .  and  weighting  ]  w^|  applied  to  the  samples  in  an  effort  to  approxi- 
mate*  (44).  That  is. 


G  (1)1  h 
c 


g  (kh)  exp(-i2rfkh)  ■  G  (f)  . 
JC  c  c 


The  approximation  in  (46)  will  be  good  if  gc  and  the  exponential  are  sam¬ 
pled  frequently  enough.  Thus,  if  the  exponential  is  not  to  vary  by  more  than  a 
radian  between  samples,  we  require 


If  I  < 


2*h  * 


When  (45)  through  (47)  are  combined,  the  desired  values  are  given  by 


GT(n  9  Gc(n  t")  "  Gc(n  t)  “  h  £  wk  *0^  exp(-i2»kn/M)  (48) 


,  .  m  I  .  ,  ,  1  M 

11  T"  <  2»h  ’  °r  ln!  r  2 


By  defining 


For  eisapl*.  'si«»f>*on'e  rul*  Kn*  *  1  3.  s  4  3  •  -> n ^  s  *  3  =  *  3 
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"k 


hw,  g  (kh),  1  <  k  <  M  -  1 

k  c 

hw  g  (0)  +  hw  g  (T/m),  k  ■  0 

O  C  MC 


(50) 


we  can  express  (48)  as 


M-l 

CT(n  T  )  *  Gc(n  f)  s  Gc(°  \  exp(-i2rta/M)  .  (51) 

which  is  an  M -point  discrete  Fourier  transform  PFT)  of  the  sequence  jd^J. 
The  factor  l/»  in  the  upper  bound  on  |  n|  in  (49)  is  due  to  the  aliasing  in  the 
frequency  domain  that  takes  place  at  |nf  ■  M/2.  In  fact,  letting  jw^}  be  the 
samples  of  waveform  w(t)  at  t  *  kh  and  W(f)  its  Fourier  transform,  it  can 
be  shown  that  (Appendix  C) 

Gc(f  “  kM  y)  •  (52) 

a  /  M  m  \ 

Thus,  the  value  Gc\  ■y  tj?  )  is  composed  of  at  least  two  overlapping  tails  of 
Gc(f).  In  order  to  avoid  this  aliasing,  we  must  observe  (49). 

To  summarize,  the  values  of  Gp(f)  at  f  =  n-y  are  given  approximately 
as  an  M-point  OFT  in  (51)  of  the  sequence  jd^[  in  (50).  When  (42)  is  substi¬ 
tuted  into  (50),  this  sequence  can  be  expressed  as 


Gc(f)  -  W  (f)  • 


“k 


(53) 


As  is  obvious  from  (53),  g(t)  must  still  be  evaluated  from  0  to  T  in  incre¬ 
ments  of  h,  that  is,  at  mM  +  1  values.  However,  collapsing  reduces  the 
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size  of  the  FFT  from  mM  to  M,  with  an  attendant  reduction  in  computation 
time  and  round-off  error.  This  method  is  related  to  one  given  in  Refer¬ 
ence  5,  p.  81. 

In  appfying  this  technique  to  numerical  integration  of  CF's,  since  the  ex¬ 
ponentials  take  the  form  exp(+i£X),  we  note  that  the  increment  in  X  at  which 
values  are  obtained  by  employing  an  FFT  are  2r/T,  or  2rm/T  for  coarser 
resolution  as  above. 


3.  CONCLUSIONS 

Several  alternate  forms  for  direct  numerical  evaluation  of  the  CDF  or 
MDF  from  the  CF  have  been  presented  that  have  utility  in  different  situations, 
including  ease  of  calculation,  rate  of  decay  of  the  integrands,  and  the  probabil¬ 
ity  region  of  interest.  Also,  the  speed  of  the  FFT  and  the  large  number  of 
values  of  the  distribution  functions  that  are  quickly  available  make  the  formulas 
presented  attractive  in  a  large  number  of  practical  applications . 

In  the  case  of  discrete  distributions  with  RV  that  can  Lake  on  positive  as 
well  as  negative  values,  all  integrals  for  the  CDF  are  finite  and  over  a  half¬ 
period  of  the  CF.  Reevaluation  of  the  sines  or  cosines,  as  in  (36)  or  (38)  for 
different  values  of  M,  can  be  avoided  if  one  notes  that 


cos[a]  +  cos 


with  a  similar  result  for  cosine.  Thus,  if  a  table  of  sin(a)  and  cos(a)  for  the 
values  of  a  (A{)  is  constructed,  this  recurrence  relation  can  be  used  to  obtain 
the  higher  order  M-dependence  required  in  (36)  and  (38)  without  reevaluating 
sines  and  cosines. 
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Appendix  A 


BEHAVIOR  OF  INTEGRAND  OF  EQ.  (3)  AT  ORIGIN 


The  integrand  of  (3)  is  given  by 


f  (*)  costfX)  f  «)  sin(£X) 
7  Im  )  ftf)  exp(-UX)}  =  — 


* 


where  subscripts  r  and  i  denote  real  and  imaginary  parts,  respectively. 
Now, 


fr«)  sin(*X) 


X  as  f  -  0+ 


And 


f.($)  cos(£X) 


=  y* dx  SiP(— -  p(x)  cos ({X) 


~J" dx  x  p(x)  s  ^  as  {  —  0+  if  ^  exists  and  is  finite. 

Here  is  the  mean  of  RV  x.  Therefore,  the  integrand  of  (3)  approaches 

m  -X  as  0+  if  m  exists  and  is  finite, 

x  x 

An  example  where  mx  is  infinite  is  given  by 


P(x) 


0,  x  <  0 


2/w 


X  >  0 


1  +  X 
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A1 

Then, 


fr(S)  83  exp(-|{|)  , 

f^)  =  sgntf)  7  {exp<-|*|)  Ei(U|)  -  exp(|?|)  Ei(-|$|)}  . 


Since 


A2 


Ei(-UI)  =  In  Ul  +  C  -  j*|  +  ^  U|2  +  0(|$|3)  asK|  -  0, 
Ei(UI)  =  In  |(|  +  C  +  III  +  “  III  2  +  0(|{| 3)  as  |f|  -  0  , 


there  follows 

^(i)  -  sgn(i)  Jf  1*1  Hn  |{|  +  1  -  C  +  i  U|2)  +  O^U|  3)J 
~  7  *  ^((Tj)  as  Ul  -  0  . 

Therefore, 


f.(£)co„«X)  /lV 

- S - r  “  ISI  ~  °  • 

which  is  unbounded,  but  integrable.  So,  in  those  cases  where  mx  is  infinite, 
the  behavior  of  fi(£)/$  at  the  origin  must  be  handled  carefully  in  order  to 
accurately  evaluate  the  integral.  One  possibility  is  to  subtract  out  the  singu¬ 
larity  and  integrate  it  analytically. 


A2I.  S.  Gradahteyn  and  I.  M.  Ryzhik,  Table  of  Integrals,  Series  and  Products, 
Academic  Presa,  Ne*  York,  1965,  Eq.  (3.723  1 ) . 

A2Ibid.,  Eqa .  (8.2H  1)  and  (8.214  2). 
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Appendix  B 
00 

EVALUATION  OF  ^exp(inx) 
n=l 


Consider  the  ordinary  function 

f(x)  =  Injsin  “|,  x  /  0,  +2r,  +4ir . 


Since  (1  +  x2)-1  f(x)  is  absolutely  integrable  from  -®  to*,  the  generalized 
function  f(x)  corresponding  to  ordinary  function  f(x)  can  be  defined.  B1  in 
fact,  the  generalized  function  f(x)  equals  the  ordinary  function  f(x)  (See  defi¬ 
nition  8  by  LighthiIlB2).  Furthermore,  the  generalized  function  f(x)  is  peri¬ 
odic,  with  period  2t,B3  and,  therefore,  can  be  expressed  asB4 


The  generalized  function  f(x)  is  absolutely  integrable  over  a  period,  since 
the  ordinary  function  f(x)  is  absolutely  integrable  over  a  period.  B5  There¬ 
fore,  the  coefficients  |cn|  in  the  expansion  of  the  generalized  function  f(x) 
are  given  byB6 


c 

n 


21*ZdXln 


-inx 

e 


cos(nx)  *  2 


/ 


1/2 

dt  ln(sinrt) 


cos(2mrt) 


B1 

M.  J.  Lighthill,  An  Introduction  to  Fourier  Analysis  and  Gener«lixed  Function*, 
Cambridge  University  Press,  New  York,  1$59,  p.  21,  definition  7.  — 

B2Ibid,  p.  25. 

B^Ibid,  p.  60,  definition  22. 

^Ibid,  p.  66,  Theorem  26. 

B^Ibid,  p.  48,  definition  19. 

B^Ibid,  p.  66,  Theorem  26,  Note. 
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In2,  n  =  0 


1 

2|n|* 


n  *  0 


> 


where  we  have  used  Eq.  (4.384  3)  by  Gradshteyn  and  Ryzhik.®7  Therefore,  the 
generalized  function  In  j  sin(x/2)|  can  be  expressed  as 


In 


n=— oo 

n  /  0 


If  we  define  the  derivative  of  the  generalized  function  ln|  sin(x/2)|  as  the 
generalized  function  cot(x/2)/2,  differentiation  of  the  last  equation  yields  the 
expression  of  the  generalized  function  cot(x/2)/2  asB8 


T.  sgn(n)  ***  . 


(This  equation  says  that  the  spectrum  of  the  generalized  function  cot(x/2)  is 
the  odd  impulse  train.  B9)  And  since2*0 


5(x  -  n2v) 

n=— oo 


we  obtain 


£  J<x  -  n2')  +  1  2  00t(f)  =  2 

n=-«o  n=i 


mx 


B7 

I.  S.  Gradshteyn  and  I.  M.  Ryihik,  Table  of  Integrals.  Series  and  Products, 
Academic  Press,  New  York,  1965. 

®^0p.  cit.,  M.  J  Lighthill,  p.  28,  Theorem  15. 

®^Ibid.,  p.  66,  Theorem  26,  Eq.  (36). 

®^Ibid.,  p.  67,  Example  38. 
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or 


-  r  + 


-  n2r)  +  i  -  cot 


in  the  sense  of  generalized  functions. 
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Appendix  C 


ALIASED  SPECTRUM 

If  we  define  the  infinite  impulse  train 

*h(t)  «£\(t  -  nh) 
n 

and  use  the  time-limited  character  of  gc,  as  given  in  (42),  it  is  possible  to 
manipulate  (46)  as  follows: 

M 

G  (f)  *  h  w  g  (kh)  exp(-i2rflch) 

C  K  C 

k*0 

T 

-  f  dt  w  (t)  gc(t)  h  afa(t)  exp(-i2rft) 


-  J  dt  w  (t)  gc(t)  h  5^(t)  exp(-i2irft) 


=  3  |w(t)  gc(t)  h  jh(t)|  =  W(f)  •  Gc(f)  •  a1/h(f) 


-  wm  Go(f  -  k  i)  • 
k 

T*  /m 

Using  h  «  — - —  ,  we  see  that  (52)  results. 
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Comparison  of  Four  Fast 
Fourier  Transform 
Algorithms 
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Albert  H.  Nuttall 


ABSTRACT 

Comparisons  of  four  FFT  (Fast  Fourier  Transform) 
algorithms  (Brenner's,  Cooley's,  Fisher's,  and 
Singleton's)  have  been  made  on  the  basis  of  pro¬ 
gram  execution  time,  storage,  and  accuracy. 

Major  modifications  have  been  made  in  the  genera¬ 
tion  of  the  trigonometric  values  in  the  Cooley  and 
Fisher  algorithms,  with  significant  improvements 
in  accuracy.  Entry  of  constants  in  all  algorithms 
has  been  changed:  the  constants  are  approximated 
by  the  best  binary  representation  for  the  UNIVAC 
11Q£  computer.  Three  waveform  examples  are  used 
in  the  comparisons,  namely,  linear  FM,  random 
numbers,  and  a  unit  ramp.  Also,  the  sizes  of  the 
FFT's  considered  are  limited  to  powers  of  2,  from 
16  through  8192. 

The  results  indicate  that  Singleton's  and  Brenner's 
algorithms  have  the  shortest  execution  times  and 
occupy  the  least  amount  of  computer  storage ,  where¬ 
as  Cooley's  and  Fisher’s  algorithms  are  the  most 
accurate.  For  example,  for  an  FFT  of  size  1024 
on  the  linear  FM  waveform,  the  maximum  relative 
errors  for  the  four  algorithms  are  0.17  x  10-6, 

0.63  x  10-7,  o.64  x  10*7,  0.41  x  10-5,  respec¬ 
tively.  Thus,  there  is  no  single  best  algorithm 
for  all  three  criteria  considered;  rather,  each 
algorithm  has  its  own  area  of  most  effective 
applicability. 
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COMPARISON  OF  FOUR  FAST  FOURIER 
TRANSFORM  ALGORITHMS 


1.0  INTRODUCTION 

Since  the  advent  of  the  Fast  Fourier  Transform  (FFT),  several  algorithms, 
each  with  its  own  claim  to  optimality,  have  been  advanced  to  effect  the  Discrete 
Fourier  Transformation.  In  an  effort  to  determine  quantitatively  the  relative  ad¬ 
vantages  and  disadvantages  of  the  various  procedures,  four  algorithms  (Brenner's,1 
Cooley's,1  Fisher's,3  and  Singleton's4)  have  been  selected  for  operational 
comparison  on  the  basis  of  program  execution  time,  storage,  and  accuracy.  The 
comparison  is  restricted  to  FFT  sizes  which  are  powers  of  2,  from  16  through 
8192.  (Cooley's  and  Fisher's  algorithms  are  able  to  handle  powers  of  2  only, 
while  Brenner's  and  Singleton's  can  handle  other  radices. ) 

In  order  to  allow  general  conclusions  (conclusions  not  restricted  to  results 
which  are  waveform-dependent),  three  different  waveform  examples  are  used 
for  the  comparison:  linear  frequency  modulation  (FM),  random  numbers,  and  a 
unit  ramp.  Both  one-way  and  two-way  error  calculations  are  carried  out  for  the 
linear  FM  waveform,  whereas  only  the  two-way  errors  are  calculated  for  the 
random  numbers  and  unit  ramp  waveforms. 

Three  measures  of  error  are  employed:  rms,  average  magnitude,  and  maxi¬ 
mum.  Theoretical  results  on  floating-point  accuracy  are  available  only  for  the 
rms  measure  of  error.'  It  was  deemed  important,  therefore,  to  evaluate  the 
accuracy  of  the  algorithms  for  all  three  error  measures  to  see  if  any  significantly 
different  conclusions  are  obtained. 


2.0  RESULTS 


The  comparison  of  the  four  FFT  algorithms  in  terms  of  execution  time, 
storage,  and  accuracy  is  carried  out  on  the  UNIVAC  1108.  The  forward  FFT 
for  a  complex  sequence  x0,  x ,,...,  x  is  defined  as 


x 


n 


N  -1 


e*p  ( -i  2  rr  on  Ni, 


0  <  o  <  N  -  1 


1 


The  inverse  FFT  is  defined  as 


i 


Xa  e*p  (i  2  n  am'S). 


0  < 


S  -  1 


Some  modifications  to  the  algorithms  have  been  made;  however,  since  these 
modifications  affect  mainly  the  accuracy,  and  not  execution  time  or  storage,  they 
are  discussed  in  detail  in  Section  2.  3,  Accuracy. 


2.  1  EXECUTION  TIME 


Execution  time  is  independent  of  the  particular  waveform  example  employed 
in  the  FFT.  Figure  1  depicts  the  execution  time  of  the  four  algorithms  versus 
the  size  of  the  transform.  The  results  indicate  that  Fisher's  and  Cooley's  algo¬ 
rithms  take  the  most  time;  for  example,  for  an  FFT  size  of  8192,  Fisher’s 
algorithm  takes  3.  75  seconds,  while  Singleton's  algorithm  takes  2.  33  seconds. 
This  significant  difference  in  time  is  somewhat  obscured  in  Fig.  1  by  the  loga¬ 
rithmic  ordinate;  however,  it  is  worth  noting.  The  other  two  algorithms,  for 
size  8192,  require  3. 13  seconds  for  Cooley  and  2.  62  seconds  for  Brenner. 

Since  the  curves  are  virtually  straight  lines  in  Fig.  1,  they  can  be  extrap¬ 
olated  to  powers  of  2  beyond  8192.  However,  one  can  not  interpolate  between 
powers  of  2  to  evaluate  execution  times  for  intermediate  FFT  sizes. 


2.  2  STORAGE  REQUIREMENTS 

Figure  2  depicts  the  number  of  storage  locations  required  for  the  four 
algorithms  as  a  function  of  the  size  of  the  FFT.  The  amount  required  represents 
both  the  number  of  data  storage  locations  and  the  number  of  instructions  that 
the  algorithms  need.  Singleton’s  and  Brenner's  algorithms  need  approximately 
the  same  amount  of  storage,  but  Cooley's  and  Fisher's  algorithms  need  an  in¬ 
creasing  amount  of  storage  as  the  size  of  the  FFT  increases  because  their 
algorithms  store  the  trigonometric  values  and  scratch  storage  in  arrays,  rather 
than  calculate  values  as  needed. 


Again,  extrapolation  to  other  powers  of  2  is  possible,  but  interpolation 
between  powers  of  2  is  not. 
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16  32  64  128  256  512  1024  2048  40<*>  81  <?2 

SIZE  OF  FFT 


Fig.  1.  Forward  FFT  Execution  Times 
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NUMBER  OF  STORAGE  LOCATIONS 


2.3  ACCURACY 


Three  measures  of  error  are  used  in  the  accuracy  comparison:  rms  error, 
average  magnitude  error,  and  maximum  error.  If  the  result  of  a  calculation 
yields  the  sequence  of  complex  numbers  £0,  z,,...,  £N-1  ,  whereas  the  de¬ 
sired  result  is  the  sequence  z0,  z, , . . . ,  z  N_t  ,  the  three  errors  are  defined 
as 


rms  error  = 


average  magnitude  error  = 


n  =  0 


maximum  error  -  max 
n 


The  three  errors  obey  the  rule  that  the  average  magnitude  error  is  never  larger 
than  the  rms  error,  which,  in  turn,  is  never  larger  than  the  maximum  error. 
(The  proof  of  the  first  inequality  follows  from  Schwartz's  inequality. )  Thus,  the 
rms  error  is  an  intermediate  measure  insofar  as  severity  of  error  is  con¬ 
cerned.  The  only  way  any  of  the  error  measures  can  be  equal  is  if  all  the  terms 
1  z  n  -  z  „  are  equal,  i.  e. ,  independent  of  n. 

Modifications  have  been  made  in  all  four  FFT  algorithms  to  improve  their 
accuracy.  These  modifications  include  the  changing  of  constant  values  to  the 
best  binary  representation  for  the  computer,  and  the  generation  of  the  trigono¬ 
metric  values  in  the  Cooley  and  Fisher  algorithms  by  calculating  one  pair  of 
sine  and  cosine  values  in  double  precision,  followed  by  double  precision  recur¬ 
sion,  and  rounding  to  single  precision.  This  procedure  keeps  execution  time  to 
a  minimum  and  improves  the  accuracy  of  the  generated  trigonometric  values, 
which  often  are  the  major  source  of  error  in  FFT  algorithms. 

Three  different  waveforms  are  considered  in  the  error  comparison  in  order 
to  eliminate  any  waveform-dependent  conclusions.  The  first  waveform  is  linear 
FM,  characterized  by  the  sequence 

z  =  exp  (i  rr  m2  /N),  0  <  m  <  N  -  1  (N  even) 

m  «*■> 
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The  FFT  of  this  sequence6  is 

N- 1 

X0  «  exp  (i  n  m2  /N)  exp  (-i  2  irma/N) 

m  *0 

m  N  l/  2  exp  (i  ir/4)  exp  (-i  n n2  /N),  0  Q  <  N  -  1  . 

We  have  here  a  simple  closed-form  theoretical  expression  for  the  one-way  FFT 
that  can  be  used  for  comparison  with  the  numerical  FFT  calculations,  accord¬ 
ing  to  the  error  measures  above.  Figure  3  is  a  flow  chart  for  the  error  calcula¬ 
tion. 

The  results  of  the  rms-error  comparison  on  the  one-way  (forward)  FFT  are 
given  in  Fig.  4,  for  N  ranging  from  16  through  2048,  in  powers  of  2.  *  The 
corresponding  results  for  average  magnitude  error  and  maximum  error  are 
given  in  Figs.  5  and  6,  respectively.  Actually,  all  these  errors  are  relative 
errors,  obtained  by  dividing  the  errors  above  by  the  average  magnitude  of  the 
correct  answer. 

There  is  considerable  similarity  between  the  results  of  Figs.  4,  5,  and  6 
for  the  three  error  measures.  Accordingly,  in  the  remainder  of  this  section 
attention  is  confined  to  the  rms-error  measure.  (Tabulations  of  all  three  errors 
for  all  three  waveforms  are  provided  in  the  appendix  to  this  report. ) 

The  increased  error  of  Singleton's  algorithm  is  strikingly  evident  in  Figs.  4, 
5,  and  6.  It  is  almost  two  orders  of  magnitude  less  accurate  than  the  Cooley  and 
Fisher  algorithms  for  an  FFT  size  at  2048  and  is  degrading  rapidly.  The  Brenner 
algorithm  is  approximately  three  times  less  accurate  at  size  2048  and  has  the 
same  rate  of  error  growth  as  the  Cooley  and  Fisher  algorithms. 

It  is  worthwhile,  at  this  point,  to  compare  the  numerical  investigation  with 
some  theoretical  calculations  of  error  conducted  by  Weinstein/  From  Eqs.  (23) 
and  (24)  of  Weinstein,  we  obtain  the  error  (for  a  one-way  FFT)  as 


where 

o  (  =  0.46  •  2*' 

^Storage  limitations  in  the  auxiliary  error  computation  program  for  the  Cooley  and  Fisher  algorithms 
prevented  us  from  investigating  the  4096  and  8192  cases  for  the  linear  FM  waveform. 
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RMS  ERROR 


Fig.  4.  RMS  Error  for  Linear  FM  (Forward  FFT) 
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Fig.  5.  Average  Magnitude  Error  for  Linear  FM  (Forward  FFT) 


MAXIMUM  ERROR 


Fig.  6.  Maximum  Error  for  Linear  FM  (Forward  FFT) 


Here,  v  is  the  logarithm  (to  the  base  2)  of  the  size  of  the  transform  (N  =  21'), 
and  t  is  the  number  of  bits  used  to  represent  the  mantissa  of  a  number.  * 


As  shown  in  Fig.  4  (where  the  theoretical  equation  is  plotted  as  x's,  with 
o\  =  1),  Weinstein's  calculations  underestimate  therms  errorbyafair  amount 
over  most  of  the  range  of  FFT  sizes.  Also,  his  calculations  indicate  a  slower 
rate  of  error  growth  with  FFT  size  than  was  actually  obtained.  As  Weinstein 
himself  notes,  this  is  probably  due  to  the  truncated  arithmetic  employed  in  the 
UNTVAC  1108.  In  fact,  if  truncated  arithmetic  is  employed  instead  of  rounding, 
the  rms  error  is  greater  by  a  factor  of  V32” at  N  =  2048  (see  Ref.  5).  This  in¬ 
creased  error  would  move  the  theoretical  curve  in  Fig.  4  to  a  very  close  approx¬ 
imation  to  the  Brenner  curve.  Fisher's  and  Cooley's  error  curves  are  somewhat 
better  because  their  trigonometric  values  are  obtained  by  rounding  while  the 
remaining  arithmetic  is  truncation;  thus,  they  constitute  a  mixed  procedure. 


For  the  other  two  waveforms  considered,  the  error  is  computed  after  a  two- 
way  FFT  is  performed;  i.  e. ,  the  FFT  is  retransformed  back  into  the  original 
(time)  domain  to  obtain  the  error  estimate  (see  Fig.  7).  The  primary  reason  for 
doing  this  is  that,  since  all  the  array  entries  in  the  time  domain  are  approximately 
unity  in  magnitude,  it  is  easy  to  form  a  meaningful  relative  error  in  the  time 
domain.  A  relative  error  formed  in  the  frequency  domain,  where  the  range  of 
values  is  several  orders  of  magnitude  for  the  random  numbers  and  unit  ramp 
waveforms,  would  be  less  meaningful.  The  linear  FM  waveform,  on  the  other 
hand,  possesses  constant  magnitude  for  all  array  entries  in  both  domains,  a 
characteristic  which  makes  it  particuJarly  appealing. 


The  results  for  the  rms  error  for  the  three  waveforms  are  given  in  Figs.  S, 

9,  and  10.  (The  average  magnitude  error  and  maximum  error  are  tabulated  in 
the  appendix. )  The  two-way  error  results  are  similar  in  form  to  the  one-way 
error  results,  with  the  exception  of  Singleton's  curve.  A  comparison  of  Figs.  4 
and  8  reveals  that  the  two-way  error  for  Singleton's  algorithm  is  less  than  the 
one-way  error,  a  discrepancy  which  must  be  due  to  fortuitous  error-cancella¬ 
tion  in  the  two-way  results.  Since  one  would  never  use  a  two-way  FFT  without 
performing  some  transformations  on  the  one-way  results,  the  two-way  Singleton 
results  must  be  used  with  reservation.  Where  Singleton's  algorithm  is  concerned, 
it  would  be  more  reasonable  to  double  the  one-way  error  of  Fig.  4  than  to  use 
the  two-way  error  of  Fig.  8. 

*  For  the  UNIVAC  1108,  t  equals  27  in  single  precision. 
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Fig.  7 


Two-Way  FFT  Error  Calculation  for  Random  Numbers  and  Unit  Ramp 
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RMS  ERROR 


SIZE  OF  FFT 

Fig.  8.  RMS  Error  for  Linear  FM  (Inverse  FFT) 
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Fig.  9.  RMS  Error  for  Random  Numbers  (Inverse  FFT) 


or  Unit  Ramp  (Inverse  FFT) 


15 


Direct  comparison  between  errors  for  different  waveforms  is  not  possible 
because  the  average  values  of  the  array  entries  are  not  identical;  e.  g. ,  the  rzns 
value  for  the  linear  FM  waveform  is  1,  for  the  random  numbers  Y5  and  for 
the  unit  ramp  \  2/3.  Such  scale  factors  would  have  to  be  included  in  order  to 
obtain  a  valid  comparison  between  waveforms. 


3.  0  CONCLUSIONS 

The  trade-off  between  the  four  algorithms  considered  is  readily  apparent: 
the  best  accuracy  is  achieved  only  at  the  expense  of  increased  execution  time 
and  storage.  If  we  are  severely  limited  by  execution  time  and  storage,  we  may 
have  to  select  a  less  accurate  FFT  algorithm;  how  important  the  errors  are  will 
depend  upon  the  particular  application. 

In  summary,  no  single  FFT  algorithm  represents  a  best  choice;  it  must  be 
left  to  the  user  to  determine  the  best  algorithm,  based  on  the  criteria  of  most 
importance  to  him. 
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APPENDIX 


TABULATION  OF  ERRORS 


(NOTE:  In  the  following  tables, 
notations  such  as  .  124-07  mean 


.  124  x  10"'.) 


Table  1 


RELATIVE  ERROR  FOR  ONE-WAY  FFT  OF  LINEAR 
FM  WAVEFORM 


Algorithm 

FFT 

Size 

RMS 

Avenge 

Mignitude 

Minimum 

Brenner 

16 

.  124-07 

.951-08 

.199-07 

Cooley 

16 

.800-08 

.811-08 

.142-07 

Filter 

16 

.416-08 

.349-08 

.631-08 

Singleton 

16 

.208-07 

.152-07 

.354-07 

Brenner 

32 

.421-07 

.326-07 

.826-07 

Cooley 

32 

.150-07 

.119-07 

.226-07 

Filter 

32 

.103-07 

.792-08 

.220-07 

Singleton 

32 

.728-07 

.570-07 

.133-06 

Brenner 

64 

.448-07 

.351-07 

.942-07 

Cooley 

64 

.155-07 

.121-07 

.345-07 

Fisher 

64 

.130-07 

.893-08 

.331-07 

Slngletoa 

64 

.136-08 

.101-06 

.311-06 

Brenner 

128 

.612-07 

.518-07 

.132-06 

Cooley 

128 

.197-07 

.171-07 

.386-07 

F  liter 

128 

.198-07 

.160-07 

.384-07 

Singleton 

128 

.175-06 

.128-06 

.444-06 

Brenner 

266 

.586-07 

.494-07 

.118-06 

Cooley 

256 

.213-07 

.181-07 

.431-07 

Fiifaer 

256 

.205-07 

.160-07 

.459-07 

Slngletoa 

256 

.334-06 

;250-06 

.863-06 

Brenner 

512 

.765-07 

.682-07 

.178-06 

Cooley 

512 

.264-07 

.239-07 

.549-07 

Fuller 

512 

.279-07 

.242-07 

.593-07 

Singleton 

512 

.665-06 

.520-06 

.  189-05 

Brenner 

1024 

.806-07 

.718-07 

.173-06 

Cooley 

1024 

.283-07 

.249-07 

.626-07 

Fiiber 

1024 

.271-07 

.228-07 

.642-07 

Singleton 

1024 

.126-05 

.968-06 

,355-05 

Brenner 

2048 

.969-07 

.890-07 

.203-06 

Cooley 

2048 

.349-07 

.321-07 

.654-07 

F liber 

2048 

.344-07 

.311-07 

.828-07 

Slngletoa 
— _ _ 

2048 

.235-05 

.176-05 

.755-05  j 

Table  2 


RELATIVE  ERROR  FOR  TWO-WAY  FFT  OF  LINEAR 
FM  WAVEFORM 


Algorithm 

FFT 

Size 

RMS 

Average 

Magnitude 

Maximum 

Brenner 

16 

.219-07 

.197-07 

.395-07 

Cooley 

16 

.684-08 

.590-08 

.159-01 

Fi&er 

16 

.543-08 

.373-08 

.931-08 

Singleton 

16 

.124-07 

.843-08 

.218-07 

Brenner 

32 

.800-07 

.628-07 

.159-06 

Cooley 

32 

.230-07 

.177-07 

.401-07 

Fither 

32 

.204-07 

.169-07 

.333-07 

Singleton 

32 

.425-07 

.280-07 

.897-07 

Brenner 

64 

.828-01 

.659-07 

.157-06 

Cooley 

64 

.237-07 

.180-07 

.525-07 

F  uber 

64 

.222-07 

.165-07 

.449-07 

Singleton 

64 

.316-07 

.289-07 

.802-07 

Brenner 

128 

.108-06 

.931-07 

.211-06 

Cooley 

128 

.344-07 

.291-07 

.619-07 

Fiiber 

128 

.381-07 

.326-07 

.673-07 

Singleton 

128 

.678-07 

.562-07 

.157-06 

Brenner 

256 

.115-06 

.980-07 

.214-06 

Cooley 

256 

.380-07 

.324-07 

.760-07 

Fiiber 

256 

.383-07 

.315-07 

.792-07 

Singleton 

256 

.898-07 

.685-07 

.242-06 

Brenner 

512 

.142-06 

.127-06 

.306  -  06 

Cooley 

512 

.510-07 

.462-07 

.954-07 

Fiiber 

512 

.539-07 

.484-07 

.989-07 

Singleton 

512 

.227-06 

.178-06 

.696-06 

Brenner 

1024 

.150-06 

.135-06 

.308-06 

Cooley 

1024 

.544-07 

.481-07 

.934-06 

Fiiber 

1024 

.520-07 

.461-07 

.105-06 

Singleton 

1024 

.294-06 

.211-06 

.126-05 

Brenner 

2048 

.181-06 

.166-06 

.344-06 

Cooley 

2048 

.683-07 

.623-07 

.127-06 

Fiiber 

2048 

.688-07 

.638-07 

.119-06 

Singleton 

2048 

.557  -  06 

.397  -  06 

.289-05 
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Table  3 


RELATIVE  ERROR  FOR  TWO-WAY  FFT  OF 
RANDOM  NUMBERS 


Algorithm 

FFT 

Size 

RMS 

Average 

Magnitude 

Maximum 

Brenner 

16 

.499-07 

.359-07 

.141-06 

Cooley 

16 

.221-07 

.165-07 

.632-07 

Fisher 

16 

.220-07 

.153-07 

.632-07 

Singleton 

16 

.276-07 

.209-07 

.802-07 

Brenner 

32 

.787-07 

.612-07 

.224-06 

Cooley 

32 

.369-07 

.308-07 

.954-07 

Fisher 

32 

.332-07 

.267-07 

.666-07 

Singleton 

32 

.543-07 

.458-07 

.114-06 

Brenner 

64 

.105-06 

.838-07 

.256-06 

Cooley 

64 

.464-07 

.366-07 

.128-06 

Fisher 

64 

.516-07 

.403-07 

.128-06 

Singleton 

64 

.966-07 

.820-07 

.187-06 

Brenner 

128 

. 166-06 

.132-06 

.592-06 

Cooley 

128 

.733-07 

.571-07 

.253-06 

Fisher 

128 

.719-07 

.603-07 

.211-06 

Singleton 

128 

.223-06 

.178-06 

.716-06 

1 

Brenner 

256 

.184-06 

.148-06 

.590-06  | 

Cooley 

256 

.881-07 

.711-07 

.298-06 

Fisher 

256 

.862-07 

.709-07 

.233-06 

Singleton 

256 

.326-06 

.261-06 

.943-06  j 

Brenner 

512 

.215-06 

.172-06 

.718-06 

Cooley 

512 

.102-06 

.813-07 

.382-06 

Fisher 

512 

.101-06 

.825-07 

.340-06 

Singleton 

512 

.737-06 

.602-06 

.259-05 

Brenner 

1024 

.255-06 

.204-06 

.894-06 

Cooley 

1024 

.123-06 

.992-07 

.443-06 

Fisher 

1024 

.124-06 

.102-06 

.424-06 

Singleton 

1024 

.152-05 

.124-05 

.524-05 

Brenner 

2048 

.288  -  06 

.230-06 

. 110-05 

Cooley 

2048 

.136-06 

. 109-06 

.506-06 

Flihet 

2048 

.137-06 

.111-06 

.554-06 

Singleton 

2048 

.317-05 

.257-05 

. 128-04 
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Table  3  (Cont'd) 

RELATIVE  ERROR  FOR  TWO-WAY  FFT  OF 
RANDOM  NUMBERS 


Algorithm 

FFT 

Size 

RMS 

Avenge 

Magnitude 

Maximum 

Brenner 

4096 

.306-06 

.245-06 

.117-05 

Cooley 

4096 

.149-06 

.119-06 

.669-06 

F  liber 

4096 

.150-06 

.122-06 

.654-06 

Single  too 

4096 

.619-05 

.502-05 

.275-04 

Bream 

8192 

.339-06 

.272-06 

.154-05 

Cooley 

8192 

.163-06 

.131-06 

.654-06 

F  liber 

8192 

.165-06 

.133-06 

.674-06 

Singleton 

8192 

.122-04 

.991-05 

.563-04 

Table  4 

RELATIVE  ERROR  FOR  TWO-WAY  FFT  OF  UNIT  RAMP 


Algorithm 

FFT 

Sise 

RMS 

Average 

Magnitude 

Maximum 

Brenner 

IS 

.250-07 

.188-07 

.537-07 

Cooley 

16 

.774-08 

.492-08 

.211-07 

Fiiher 

16 

.101-07 

.889-08 

.211-07 

Singleton 

16 

.121-07 

.833-08 

.239-07 

Brenner 

32 

.361-07 

.305-07 

1.000-07 

Cooley 

32 

.129-07 

. 102-07 

.421-07 

Fiiher 

32 

.172-07 

.142-07 

.421-07 

Singletoa 

32 

.213-07 

.183-07 

.421-07 

Brenner 

64 

.411-07 

.337-07 

.120-06 

Cooley 

64 

.180-07 

.141-07 

.632-07 

Fiiher 

64 

.206-07 

.171-07 

.477-07 

Singleton 

64 

.367-07 

.296-07 

.107-06 

Brenner 

128 

.529-07 

.449-07 

.180-06 

Cooley 

128 

.238-07 

.181-07 

.954-07 

Fiiher 

128 

.246-07 

.195-07 

.745-07 

Singletoa 

128 

.702-07 

.564-07 

.249-06 

Brenner 

256 

.575-07 

.483-07 

.208-06  j 

Cooley 

256 

.284-07 

.222-07 

.116-06 

Fiiher 

256 

.298-07 

.241-07 

.105-06 

Singleton 

256 

.107-06 

.820-07 

.463-06 

Brenner 

512 

.696-07 

.593-07 

.283-06  | 

Cooley 

512 

.331-07 

.255-07 

.149-06 

Fitber 

512 

.342-07 

.273-07 

.  126-06 

Singleton 

512 

.236-06 

.178-06 

.119-05 

Brenner 

1024 

.743-07 

.628-07 

.316-06 

Cooley 

1024 

.366-07 

.279-07 

.191-06 

Fisbet 

1024 

.396-07 

.319-07 

.158-06 

Singleton 

1024 

.440-06 

.328-06 

.270-06 

Table  4  (Coat'd) 

RELATIVE  ERROR  FOR  TWO-WAY  FFT  OF  UNIT  RAMP 


Algorithm 

FFT 

Sine 

RMS 

Average 

Magnitude 

Maximum 

Bread  cr 

2048 

.822*01 

.150-01 

.401-06 

Cooley 

2048 

.412-01 

.315-01 

.233-06 

FUhcr 

2048 

.438-01 

.350-01 

.191-06 

Singleton 

2048 

.936-06 

.694-06 

.642-06 

Brenner 

4096 

.899-01 

.160-01 

.434-06 

Cooley 

4096 

.464-01 

.359-01 

.211-06 

FUhcr 

4096 

.491-01 

.395-01 

.244-06 

Singleton 

4096 

.  185-05 

.131-05 

.144-04 

Brenner 

8192 

.101-06 

.810-01 

.511-06 

Cooley 

8192 

.502-01 

.384-01 

.310-06 

Fuber 

6192 

.530-01 

.422-01 

.265-06 

Singleton 

3192 

.310-05 

.213-05 

.328-04 
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Spectral  Estimation  by 
Means  of  Overlapped  Fast 
Fourier  Transform 
Processing  of  Windowed 

Data 

Albert  H.  Nuttall 
ABSTRACT 


An  investigation  of  power-densi ty  autospectnan estimation 
by  means  of  overlapped  Fast  Fourier  Transform  (FFT) 
processing  of  windowed  data  is  conducted  for  four  candidate 
spectral  windows  with  good  side-lobe  behavior.  A  compari¬ 
son  of  the  four  spectral  windows  is  made  on  the  basis  of 
equal  half-power  resolution  bandwidths.  The  criteria  for 
comparison  are:  (1)  statistical  stabi 1 i ty  of  the  spectral 
estimates,  (2)  leakage  (side  lobes)  of  the  spectral  windows, 

(3)  number  of  FFTs  (number  of  overlapped  pieces)  required, 
and  (4)  size  of  each  FFT  required.  The  dependence  of  these 
criteria  on  the  amount  of  overlap  is  investigated  quantitatively. 

Some  striking  Invariances  are  discovered.  Specifically,  it 
Is  shown  that  the  ultimate  variance-reduction  capabilities 
of  the  four  windows ,  as  measured  by  the  equivalent  number  of 
degrees  of  freedom  (EDF),  are  virtually  identical  under  the 
constraint  of  equal  half-power  bandwidths.  Furthermore,  when 
the  proper  overlap  is  used  for  each  window,  the  stability  of 
this  method  of  spectral  estimation  is  identical  to  that  of  the 
"indirect”  correlation  approach.  Also,  the  number  of  FFTs 
requi red  to  real Ize  99  percent  (or  less)  of  the  maximum  EDF 
is  virtually  indeoendent  of  the  particular  window  employed. 

The  required  fractional  overlap  of  the  four  data  windows  for 
99  percent  (or  less)  of  the  maximum  EDF  is  virtual ly  Independ¬ 
ent  of  the  product  of  the  available  time  and  the  resolution 
bandwidth,  although  it  does  depend  on  the  particular  window. 
Tables  of  required  overlap  are  presented.  The  only  tradeoff 
among  the  four  windows  is  that  those  with  better  side  lobes 
require  larger-size  FFTs.  All  of  these  results  are  derived 
for  a  Gaussian  random  process,  under  the  assumption  that  the 
resolution  bandwidth  of  the  spectral  window  is  smaller  than 
the  finest  detail  In  the  true  spectrun. 

Rules  of  thimb  for  the  maximum  EDF  and  the  number  of  FFTs 
required  to  realize  99  percent  of  the  maximum  EDF  are  given. 

The  possibility  of  weighting  individual  spectral  estimates 
unequal ly  in  order  to  optimize  the  EDF  is  Investigated;  the 
gain  is  found  to  be  negl igible  for  cases  of  practical  interest. 


Approved  lor  public  release;  distribution  unlimited. 
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SPECTRAL  ESTIMATION  BY  MEANS  OF  OVERLAPPED 
FAST  FOURIER  TRANSFORM  PROCESSING  OF  WINDOWED  DATA 


INTRODUCTION 

Estimation  of  the  power-density  spectra  of  stationary  random  processes  is 
an  important  problem  and  occurs  frequently  in  many  fields.  The  resolution  of 
closely  spaced  frequency  components,  with  limited  amounts  of  data,  presents 
inherent  limitations  on  the  statistical  stability  of  the  estimates.  Also,  the  pre¬ 
vention  of  leakage  of  undesired  frequency  components  into  the  frequency  being 
analyzed  dictates  a  careful  choice  of  data  weighting.  Lastly,  the  extent  and 
complexity  of  the  data  processing  required  to  realize  the  desired  resolution, 
stability,  and  leakage  control  are  important  considerations. 

The  fundamental,  conflicting  desires  involved  in  spectral  estimation  be¬ 
come  painfully  obvious  when  the  amount  of  data  available  for  analysis  is  limited 
and  can  not  be  augmented  by  additional  measurements.  For  example,  the  avail¬ 
able  record  length  may  be  limited  by 

a.  nonstationary  conditions  (changing  environment), 

b.  storage  limitations. 

c.  equipment  failure,  and 

d.  time-sharing  requirements. 

.Although  factors  b,  c,  and  d  can  often  be  remedied  or  corrected,  factor  a 
often  can  not  be  controlled.  Thus,  only  a  small  segment  of  the  time  record 
may  be  usable  for  each  spectral  analysis.  If  fine  frequency  resolution  is  de¬ 
sired.  the  limited  number  of  independent  observations  available  makes  stable 
estimation  impossible  in  some  cases.  One  must  then  be  willing  to  accept 
coarser,  but  more  stable,  spectral  estimates. 

The  two  fundamental  parameters  that  control  the  performance  of  spectral 
estimation  are  the  available  record  length,  T.  in  which  the  sample  of  the 
random  process  is  assumed  stationary,  and  the  desired  frequency  resolution, 

B.  of  the  spectral  analysis.  Large  values  of  the  fundamental  BT  product 
yield  good  performance  of  the  analysis  technique,  but  small  values  are  often 
forced  upon  us  by  too  small  a  record  length  T  or  too  fine  a  desired  resolution 
B.  The  problem  here  is  to  make  maximum  use  of  the  available  data. 
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Spectral  analysis  has  received  much  attention  in  the  past  [1-6],  especially 
since  the  advent  of  the  Fast  Fourier  Transform  (FFT)  [7,8].  In  particular,  the 
method  of  averaging  short  modified  periodograms  [9]  is  a  prime  candidate  for 
spectral  analysis  —  for  several  reasons.  First  of  all,  nonstationary  trends  in 
the  data  are  more  readily  observable  through  the  time-local  spectral  estimates 
of  each  segment.  Second,  the  size  of  each  FFT  can  be  kept  reasonably  small, 
thereby  reducing  storage,  execution  time,  and  round-off  error.  Third,  the  fre¬ 
quency  resolution  is  easily  controlled  by  the  choice  of  segment  length,  and  leak¬ 
age  (side  lobes)  can  be  controlled  by  the  proper  choice  of  window  in  the  time 
domain.  Lastly,  overlapped  segments  of  windowed  data  utilize  more  fully  the 
variance-reduction  capability  of  a  given  record  length. 

The  problem  to  be  addressed  here  has  to  do  with  the  choice  of  window  and 
amount  of  overlap  to  employ  for  a  particular  application.  Specifically,  if  we 
employ  a  window  with  very  small  spectral  side  lobes,  how  much  should  the 
segments  be  overlapped,  and  does  the  overlap  vary  greatly  with  the  particular 
window  selected?  (Fifty-percent  overlap  has  been  suggested  as  a  reasonable 
procedure  for  the  triangular  data  window  [9.  p.  72].)  How  many  FFTs  of  what 
size  have  to  be  performed  for  the  different  windows?  Is  the  variance-reduction 
capability  dependent  on  the  particular  window? 

Four  windows  will  be  investigated.  They  are  called  data  windows  in  the 
time  domain,  where  they  are  multiplied  by  the  available  data  record,  they  will 
be  called  spectral  windows  in  the  frequency  domain,  where  their  main  effect 
enters  via  convolution.  The  four  data  windows  are  called  triangular,  cosine.  * 
quadratic,  and  cubic  and  will  be  documented  in  a  later  section.  Hamming 
weighting  [2.  p.  14],  although  it  possesses  good  adjacent  side  lobes,  is  not 
considered  here  because  the  spectral  window  decays  very  slowly  with  frequency, 
thereby  responding  to  frequencies  far  removed  from  the  analysis  band  of 
interest. 


PROBLEM  DEFINITION 

Consider  that  stationary  random  process  x  has  been  observed  for  a  time 
interval  T  seconds,  that  is,  x(t)  for  0<  t  <  T  is  available.  Let  the  power- 
density  autospectrum  of  this  process  at  frequency  f  be  denoted  by  G(f),  where 
double-sided  spectral  notation  will  be  employed.  We  wish  to  estimate  spec¬ 
trum  G  with  a  resolution  of  B  hertz,  where  B  is  the  half-power  (-3  dB> 
bandwidth  of  the  desired  resolution. 


•Also  called  Hanning  weighting. 
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DATA  WINDOW  w 


0  S  L  S-L  fP-l)S  (P-1  )S*  L 


Fig.  1.  Overlapped  Data  Windows 

The  method  of  obtaining  the  spectral  estimates  is  depicted  in  Fig.  1.  A 
data  window  w  of  duration  L  seconds  is  applied  successively  to  the  available 

data  x  in  the  overlapping  intervals  (0,L),  (S.S+L) .  ((P-l)S,  (P-1)S+L) 

S  is  the  amount  of  shift  each  adjacent  data  window  undergoes,  and  P  is  the 
total  number  of  pieces  or  segments  employed.  Since  only  T  seconds  of  data 
are  available,  we  must  have 


(P-1)S+L  <  T  .  (1) 

The  segment  length  L  should  be  large  enough  that  the  correlation  function  of 
process  x  is  effectively  zero  for  delays  larger  than  L/2.  (The  relation  be¬ 
tween  frequency  resolution  B  and  segment  length  L  is  discussed  quantitatively 
later.)  The  form  of  the  data  window,  depicted  in  Fig.  2,  is  even  about  the 
origin  and  real.  Also,  w(t)  is  zero  for  iti>L/2. 


w(») 
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When  the  overlap  in  Fig.  1  is  a  significant  fraction  of  the  segment  length, 
the  effective  use  of  the  available  data  x  is  fairly  uniform  over  the  entire  inter¬ 
val  T  except  for  the  edges  of  the  data,  where  a  gradual  taper  over  an  interval 
of  length  L/2  takes  place.  This  is  consistent  with  earlier  suggestions  [lO, 
p.  58]  for  maximum  use  of  the  available  data.  Notice  that  the  percentage  of 
taper  depends  on  the  desired  frequency  resolution  and  available  record  length, 
and  is  not  a  constant,  such  as  10  percent,  as  has  occasionally  been  suggested. 

The  estimate  of  the  power-density  spectrum  G  is  obtained  as  follows. 
First,  a  Fourier  transform  on  the  p-th  windowed  section  is  performed*: 

Yp(f)  =fdt  exp(-i2irft)  x(t)  w^t  -  —  -  (p-l>sj  ,  1  <  p  <  P  .  (2) 

The  spectral  estimate  at  frequency  f  is  then  available  as  the  average  of  the  P 
pieces: 


Equation  (2)  assumes  a  continuous,  rather  than  discrete,  form  of  signal 
processing  [2,  Secs.  4-11  versus  12-2l].  However,  if  the  discrete  version  of 
Eq.  (2),  where  samples  of  a  are  taken  At  seconds  apart,  is  such  that  aliasing 
is  negligible,  there  is  little  difference  between  the  two  methods  of  spectral 
analysis  [2,  pp.  37-39  and  123-125].  We  shall  assume  that  At  is  so  chosen 
and  confine  attention  here  to  the  continuous  processing  technique  of  Eq.  (2).  Of 
course,  in  practice,  Eq.  (2)  is  approximated  by  a  discrete  Fourier  transform 
[9,  p.  70] ,  in  which  case  dc  and  linear-trend  removal  should  be  considered  for 
the  sampled  data  [2,  pp.  4  7-49]. 


The  spectral  estimate  G(f)  in  Eq.  (3)  is  a  random  variable.  Its  mean  and 
variance  are  evaluated  in  Appendix  A  under  the  assumptions  that  x  is  a  Gauss¬ 
ian  random  process  and  that  the  frequency  resolution  of  the  spectral  window 
|  W|  “ ,  whe  re 


■/- 


exp(-i2;rft)  w(t)  , 


is  narrower  than  the  finest  detail  in  the  true  spectrum  G.  (This  latter  assump¬ 
tion  is  equivalent  to  that  given  under  Eq.  (1)  for  segment  length  L.)  The  re¬ 
sults  are 


^Integrals  without  limits  are  over  the  range  of  non-zero  integrand. 


Ej6(f)}  =/d,  G(f- »>)  |W (^) |2  ~  G(f) fdv  |W(^)|2  , 

j  G(f)|  2?  £  T  (l  -  i|)|  fdn  exp(i2rMkS)  G(m)  |w(f-M)|S 
k=-(P-l)  ' 


2G!|^k|,,(‘^)W!' 


where 


0^(r)  = J ~ dt  w(t)  w*(t  -  r)  . 


Relation  (5)  shows  that  the  mean  of  the  spectral  estimate  is  equal  to  the  con¬ 
volution  of  the  true  spectrum  G  with  the  spectral  window  |W|^.  Relation  (6) 
expresses  the  variance  of  the  spectral  estimate  in  terms  of  the  number  of 
pieces  P,  the  shift  S,  and  the  autocorrelation  ©w  of  the  data  window.  The 
result,  Eq.  (6),  holds  if  f  is  greater  than  the  bandwidth  of  the  spectral  win¬ 
dow;  the  right  side  of  Eq.  (6)  must  be  doubled  if  f  =  0  [see  also  9,  p.  7l]  . 

The  equivalent  number  of  degrees  of  freedom  (EDF)  in  spectral  estimate 
G  is  defined  as  [2,  p.  22] 


E2  I'Gjf)} 
Var  {'em} 


K  =  2 


P-1  /  v 

Z  (‘4') 


k  =  -(P-l) 


©  (0) 
w 


employing  Eqs.  (4)  through  (6).  Notice  that  under  the  assumptions  given  above, 
K  is  independent  of  the  value  of  frequency  f  and  true  spectrum  G,  for  f  =  0, 
K  is  given  by  one  half  of  Eq.  (8). 

For  computational  purposes,  it  is  convenient  to  define  a  normalized  data 
window  u  according  to 


u(x)  =  w(Lx)  . 


Then  u  contains  the  shape  information  of  data  window  w  but  extends  only  over 
the  interval  (-1/2,  1/2).  It  follows  that 


where 


• Jr)  =  L  *  (r/ L)  , 

w  u 


dt  u(t)  u*(t-  t)  . 


(10) 

(11) 


The  EDF  in  Eq.  (8)  then  becomes 


K  =  • 


2P 


P-1 

£ 

k=-(P-l) 


(■-40 


<t>  (kS/L) 
u 


♦u<°> 


(12) 


In  order  to  minimize  the  fluctuations  in  the  spectral  estimate  G,  we 
should  maximize  K  in  Eq.  (12).  To  accomplish  this  for  a  given  number  of 
pieces  P  and  segment  length  L,  the  shift  S  in  Eq.  (12)  should  be  chosen  as 
large  as  possible  so  that  <<>u  is  as  small  as  possible  at  S/L,  2S/L,  etc. 
However,  since  Eq.  (1)  dictates  a  constraint  among  these  variables,  the  best 
choice  of  shift  S  —  for  given  record  length  T,  number  of  pieces  P,  and 
segment  length  L  —  is  given  by  equality  in  Eq.  (1): 


S  =  (for  P  >  2)  . 

Substituting  Eq.  (13)  in  Eq.  (12),  we  have 


(13) 


<  •  x presses  the  EDF  as  a  function  of 


P,  number  of  pieces  in  the  average, 

T/L,  ratio  of  record  length  to  segment  length,  and 
<*>^,  autocorrelation  of  shape  of  data  window. 

The  problem  now  is  to  maximize  the  EDF  in  Eq.  (14)  by  choosing,  subject  to 
specified  record  length  T  and  desired  half-power  frequency  resolution  B,  the 
number  of  pieces,  P,  for  several  data  windows,  w.  It  will  turn  out  that  the 
optimum  value  for  P  is  not  infinite. 

One  special  case  of  Eq.  (14)  is  worth  comment:  if  P  <  T/L,  the  values 
of  d>u  in  Eq.  (14)  are  zero  since  u  extends  only  over  (-1/2,  l/2).  Then  K 
equals  2P;  that  is,  regardless  of  the  window,  the  EDF  increases  linearly 
with  the  number  of  pieces  P  until  overlap  occurs  (see  Eq.  (13)).  As  P  in¬ 
creases  somewhat  beyond  T/L,  K  continues  to  increase,  although  at  a  slower 
rate,  because  the  overlapped  pieces  are  progressively  more  statistically  de¬ 
pendent.  Of  importance  in  the  behavior  of  K  are  the  rate  of  increase  of  K 
with  P,  and  the  maximum  value  of  K  attainable  through  the  choice  of  P. 

LIMITING  VALUE  OF  EQUIVALENT  NUMBER  OF  DEGREES  OF  FREEDOM 

As  the  number  of  pieces  P  tends  to  infinity,  the  overlap  approaches  100 
percent,  and  the  denominator  of  Eq.  (14)  approaches  an  integral,  yielding 
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This  limiting  value  depends  only  on  T/L  and  the  correlation  <t>  of1 

u  of  the  window.  It  is  finite  because  the  overlapped  pieces  are  statistically 

dependent. 

For  large  values  of  T/L  (ratio  of  record  length  to  segment  length),  an 
alternate  form  of  Eq.  (15)  is  very  illuminating.  If  Eq.  (10)  is  utilized,  the 
denominator  approaches 


<t>  (Lr) 
w 


0  (0) 
w 


<t>  (t) 
w 

0  (0) 
w 


but  from  Eqs.  (7)  and  (4), 


and 


giving 


<*>w(0) -f It  |w(t)|2  =/df  |W(f)|2  . 
«w(t)  =y*df  exp(i27rft)  j  W(f)|2  , 

/ dt  kw(t)|2  =  /df  |W(f)|4 


(16) 


(17) 

(18) 


(19) 


by  Parseval's  Theorem.  Combining  Eqs.  (15)  through  (19),  we  obtain 


If  we  define  the  statistical  bandwidth  [5,  p.  265]  of  spectral  window  |w|“  as 


then 


(21) 


K  5  2  (T  -  L)  B 

uo  St 


(22) 
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The  constant  Cst  is  dimensionless  and  of  the  order  of  unity;  it  depends  only  on 
the  shape  of  the  window. 


where  U  is  the  Fourier  transform  of  u  (see  also  Eq.  (9)). 

The  first  form  in  Eq.  (22)  for  indicates  that  if  windows  are  compared 
on  the  basis  of  equal  statistical  bandwidths  (by  appropriate  choice  of  segment 
length  L  for  each  data  window),  then  all  windows  have  the  same  value  of 
for  large  T/L;  that  is,  all  windows  have  the  same  variance-reduction  capabil¬ 
ity,  when  compared  on  the  basis  of  equal  statistical  bandwidths,  if  the  avail¬ 
able  record  length  is  much  larger  than  the  segment  length.  For  other  measures 
of  bandwidth,  such  as  the  half-power  bandwidth  B,  we  are  led  to  anticipate 
this  same  result.  In  a  later  section,  we  will  demonstrate  this  quantitatively  not 
only  for  T/L  »  1  but  for  small  values  of  this  ratio  as  well ,  and  for  finite 
values  of  P,  the  number  of  pieces. 


It  is  of  interest  to  compare  Eq.  (20)  with  the  results  of  Blackman  and  Tukey 
[2,  Secs.  B6-B8]  for  spectral  estimation  via  the  ’’indirect"  correlation  func¬ 
tion  approach.  Their  EDF  at  frequency  fy  is  given  approximately  by 


2T 


r  r° 

/df  [Q  (f+f  )  +Q.(f-f1)]  G(f) 

-Jo, 


/ 


df  [Q.(M1)  +Qi(f-f1)]2  G2(f) 


(24) 


for  long  records,  where  Qi  is  their  spectral  window.  For  frequencies  f^ 
greater  than  the  width  of  spectral  window  Q^,  and  assuming  that  Qi  is  narrow 
compared  with  the  finest  detail  in  true  spectrum  G,  Eq.  (24)  becomes  approxi¬ 
mately 


2T 


QO 

df  Q2(f-f1> 


2 


(25) 


9 


In  order  to  relate  the  EDF  in  Eq.  (25)  to  the  one  used  here,  we  note  that  the 
mean  value  of  Blackman  and  Tukey's  spectral  estimate  is  given  by  the  convolu¬ 
tion  of  Qi  with  G.  We  then  identify  with  |w|2,  obtaining  for  the  EDF  in 
Eq.  (25): 

\  fdi  |W(f)|2l 

2T  - ’  (26) 

f  df|W(f)|4 
-  00 

which  is  in  agreement  with  Eq.  (20)  for  long  records.  Thus,  under  the  assump¬ 
tions  given  above,  the  same  limiting  value  of  EDF  is  realized  by  both  the 
"indirect”  correlation  approach  and  the  present  "direct”  FFT  approach;  that  is, 
both  methods  are  capable  of  the  same  statistical  stability  if  the  proper  overlap 
is  used  in  the  FFT  approach. 


DATA  WINDOWS  AND  CHARACTERISTICS 

Four  data  windows  will  be  considered  here.  They  are  all  continuous;  how¬ 
ever,  they  have  differing  degrees  of  continuity  in  their  derivatives,  leading  to 
different  rates  of  decay  of  their  spectral  windows  for  large  frequencies.  The 
triangular  data  window,  made  up  of  two  straight -line  segments,  has  a  discon¬ 
tinuous  first  derivative.  The  cosine  data  window  (Hanning)  has  a  discontinuous 
second  derivative.  The  quadratic  data  window  is  made  up  of  segments  of 
quadratic  curves  so  chosen  that  the  first  derivative  is  everywhere  continuous, 
but  the  second  derivative  is  discontinuous;  thus,  the  quadratic  data  window  has 
behavior  similar  to  that  of  the  cosine  data  window.  The  cubic  data  window  is 
made  up  of  segments  of  cubic  curves  30  chosen  that  the  second  derivative  is 
everywhere  continuous,  but  the  third  derivative  is  discontinuous.  This  se¬ 
quence  of  windows  will  have  progressively  better  high-frequency  decay  and,  as 
will  be  seen  shortly,  better  side  lobes  at  low  frequencies.  (Hamming  weighting 
is  not  considered  because  its  discontinuous  data  window  yields  a  slowly  decay¬ 
ing  spectral  window  for  large  frequencies.)  Computation  of  the  quadratic 
and  cubic  data  windows  is  easier  and  faster  than  computation  of  the  cosine  data 
window  in  the  time  domain.  *  Their  computational  advantage  and  better  side- 
lobe  behavior,  make  them  attractive  candidates  for  spectral  analysis. 


*ln  the  frequency  domain,  the  cosine  data  window  is  equivalent  to  convolu¬ 
tion  with  the  sequence  -1/4,  1/2,  -1/4,  which  is  easily  implemented. 
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The  four  windows  are  detailed  below  in  Eqs.  (2  7)  through  (30).  They  have 
been  normalized  in  such  a  way  that  U(0)  =  1.  From  Eq.  (9)  and  Fig.  1,  notice 
that  u(t)  =  0  for  iti  >  1/2.  (The  expressions  for  the  correlations  $u  of  the 
windows  are  collected  in  Appendix  B;  these  correlations  are  necessary  for  the 
evaluation  of  Eq.  (14).)  In  the  following,  sine (x)  =  sin (rx)/(rx). 


Triangular  Data  Window 

u(t)»  2  (1 -2lti),  itl<^ 

U(f)  =  sinc2(f/2)  .  (2  7) 


Cosine  Data  Window 


u(t)  =  1  +  cos(2jrt),  1 1 1  <  — 


u<f)  =  ^££L  . 

i  -  r 


(28) 


Quadratic  Data  Window 


u(t)  = 


f(i-12,2),  ,t,<i 

— —  (l  -  21, ])2, 


U(f)  =  sinc2(f/3)  . 


(29) 
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Cubic  Data  Window 


u(t)  = 


8  /  2  3\  1 

“(l  -  24t  +  48  It  I  ),  Itl  <  ~ 


U(f)  =  sine  (f/4)  . 


(30) 


The  function  W(f)  is  available  from  the  normalized  function  U(f)  accord¬ 
ing  to 


W(f)  =  L  U(Lf)  ,  (31) 

upon  Fourier  transformation  of  Eq.  (9).  We  define  the  half-power  bandwidth 
of  spectral  window  |W|2  as  the  frequency  range  over  which  |W|2  is  greater 
than  half  of  its  peak  value: 


K±iB)|24|w(0)|2.  (32) 

In  addition  to  the  statistical-bandwidth  constant  Cst  defined  in  Eq.  (21),  we 
define  a  half-power-bandwidth  constant  C  according  to 

B  =  ~  .  (33) 

Numerical  values  for  both  of  these  dimensionless  constants  for  the  four  win¬ 
dows  are  given  in  Table  1. 

The  bandwidth  constants  are  larger  for  the  "smoother"  data  windows;  thus, 
their  bandwidths  are  larger  for  a  given  segment  length  L  (Fig.  1).  Alternatively, 
if  the  bandwidths  are  to  be  kept  equal  for  the  four  windows,  the  segment  lengths 
must  be  larger  for  the  smoother  data  windows. 

In  the  bottom  row  of  Table  1,  the  ratio  of  the  statistical-bandwidth  constant 
to  the  half-power-bandwidth  constant  is  found  to  be  relatively  constant  for  the 
four  windows  considered.  Thus,  the  statistical  stability  of  the  spectral  esti- 
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Table  1 


BANDWIDTH  CONSTANTS 


Data  Window 

Triangular 

Cosine 

Quadratic 

Cubic 

Cst 

1.854 

2.079 

2.304 

2.  686 

c 

1.276 

1.441 

1.572 

1.820 

cst/c 

1.454 

_ 

1.443 

1.466 

1.476 

mates  can  be  discussed  in  terms  of  either  bandwidth  without  fear  of  changing 
significantly  the  quantitative  aspects .  For  example,  Eq.  (22)  becomes 

Ka)a2.9(T-L)B,  for  £  »  1 ,  (34) 

in  terms  of  half-power  bandwidth  B,  where  Eqs.  (21)  and  (33)  and  Table  1 
have  been  employed.  A  more  precise  relation  than  Eq.  (34)  will  be  given  for 
K  in  the  next  section,  where  the  number  of  pieces  P  will  be  finite. 

Half  of  each  symmetric  spectral  window  is  plotted  in  dB  versus  f/B  in 
Figs.  3  through  6.  Here 

dB  =  10  log10  |U|2  ,  (35) 

since  the  power -density  spectrum  G  is  seen  through  a  window  proportional 
to  | U 1  ^ .  All  the  plots  go  through  -3.01  dB  at  f/B  =  1/2,  since  B  is  the  half¬ 
power  bandwidth.  The  slow  spectral  decay  of  the  triangular  data  window  and 
the  fast  spectral  decay  of  the  cubic  data  window  are  evident.  The  cosine  and 
quadratic  data  windows  exhibit  intermediate  behavior.  The  first  three  side 
lobes  of  the  spectral  windows  are  given  in  Table  2,  where  it  is  seen  that  the 
quadratic  window  offers  an  8.3-dB  improvement  relative  to  the  cosine  window 
in  the  size  of  the  first  side  lobe,  and  the  cubic  window  yields  an  additional 
13. 3-dB  improvement.  . 
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1 .276 


15 


Fig.  4.  Spectral  Window  for  Cosine  Data  Window 


>r  Quadratic  Data  Window 


TafileV 


FIRST  THREE  SIDE  LOBES  OF 
SPECTRAL  WINDOWS 


Data  Window 

Side  Lobes  (dB) 

Triangular 

-26.  5 

-35.7 

-41.6 

Cosine 

-31.5 

-41.  5 

-48.5 

Quadratic 

-39.  8 

-53.5 

-62.4 

Cubic 

-53. 1 

-71.3 

-83.2 

RESULTS 


The  general  expression  for  EDF  is  given  in  Eq.  (14).  We  eliminate  the 
segment  length  L  in  this  expression  in  favor  of  the  half-power  bandwidth  B 
by  using  Eq.  (33)  to  obtain  the  dependence  on  the  fundamental  parameter  BT 
(see  Introduction).  It  follows  that 


P-1  / 

l  .  Ik  \ 

*  (kBT/c-‘) 

2 

E  ( 

uv  P-1  ' 

1  p) 

<f>  (0) 

k— (P-i) 

u 

(36) 


For  a  particular  window  u  and  value  of  BT,  K  is  computed  versus  P.  A 
sample  tabulation  for  the  cosine  data  window  and  BT  =  8  is  given  in  Table  3. 
The  column  headed  "Fractional  Overlap"  is  a  measure  of  how  much  the  individual 
data  windows  overlap  in  the  spectral  processing  technique  depicted  in  Fig.  1  and 
is  given  by 
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Table  3 


EQUIVALENT  DEGREES  OF  FREEDOM 
VERSUS  NUMBER  OF  PIECES. 

BT  =  8,  COSINE  DATA  WINDOW 


p 

K 

F  ractjonal 

Overlap 

2 

4.  00 

i 1  1 

3 

6.  00 

.  • 

4 

».  00 

.  - 

5 

10.  00 

. 

6 

12.  00 

•  <4 

7 

14.  00 

.  - 

8 

15.  96 

9 

17.  74 

10 

19.  l;: 

t  * 

11 

20.  o:; 

12 

20.  5u 

13 

20.  69 

14 

20.  72 

15 

20.71 

16 

2 0.  0*. 

17 

20.  6  1 

18 

2o.  :>• 

19 

2  0.  52 

20 

2 1 1 .  4  * 

30 

.  4  ■* 

40 

50 

100 

300 
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UNCLASSIFIED 


F7G  774 


1.25 


1.4 


1.6 


(37) 


L-S  P-T/L  P-BT/C 
L  =  P-1  =  P-1 


when  non-negative.  Equations  (13)  and  (33)  were  employed  in  Eq.  (37). 


Several  points  in  Table  3  are  worth  noting.  For  no  overlap,  the  EDF 
increases  linearly  with  P ,  and  even  as  overlap  begins  to  occur,  the  rate 
of  increase  of  EDF  remains  the  same.  -Thus,  24%  overlap  still  yields  the 
maximum  possible  EDF  for  P  =  7.  However,  as  P  and  the  overlap  increase 
further,  the  EDF  increases  more  slowly,  eventually  reaching  a  maximum,* 
after  which  it  decreases  slightly  for  further  increases  in  P.  A  point  of 
diminishing  returns  is  reached  somewhere  near  P  =  12,  where  98%  of  the 
maximum  (max)  EDF  is  realized.  The  extra  computational  effort  in  spectral 
analysis  for  P  >  12  is  not  worth  the  return  in  stability.  The  fractional  over¬ 
lap  for  98%  of  max  EDF  is  .59;  the  EDF  is  then  20.5,  whereas  it  was  only 
10. 0  for  the  last  non -overlapped  example.  The  case  for  overlapped  processing 
is  well  demonstrated  by  Table  3. 


Similar  results  for  all  four  windows  and  BT  *  2,  4,  8,  16,  32,  and  64 
are  condensed  in  Table  4,  which  gives  the  required  number  of  pieces  and 
the  corresponding  fractional  overlap  for  a  specified  fraction  of  the  max  EDF. 
For  example,  in  Table  4B  for  the  cosine  data  window,  in  order  to  realize 
a  specified  fraction  of  98%  of  the  max  EDF  at  BT  =  8,  the  number  of  pieces 
required  is  12,  and  the  corresponding  fractional  overlap  is  .59.  The  bottom 
row  of  each  data-window  table  gives  the  max  EDF  for  the  corresponding 
value  of  BT.  Also  provided  is  an  equation  for  max  EDF,  which  was  empiri¬ 
cally  determined  to  fit  through  the  numerical  values  obtained. 


Several  striking  invariances  are  apparent  upon  inspection  of  Table  4.  First, 
for  a  given  record  length  T  and  desired  frequency  resolution  B,  the  max 


•The  existence  of  a  finite  value  of  P  for  maximum  EDF  is  similar  to  the 
situation  cited  in  Reference  11. 
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EDF  is  virtually  independent*  of  the  window  employed;  that  is.  all  the  windows 
considered  have  the  same  variance-reduction  capability  in  spectral  estimation 
when  compared  under  the  same  frequency-resolution  constraint.  A  simple, 
approximate  rule  of  thumb  for  all  four  windows  is  given  by 


max  EDF  «3  (BT  -  1). 


(38) 


Recall  that  B  is  the  half-power  bandwidth  of  the  spectral  window. 


For  a  given  window  and  specified  fraction  of  the  max  EDF,  it  will  be  ob¬ 
served  from  Table  4  that  for  BT  >  4,  the  required  fractional  overlap  is 
approximately  constant  (independent  of  BT).  Therefore,  in  the  last  column  of 
Table  4  is  entered  a  representative  or  "average"  fractional  overlap  required 
for  the  specified  fraction  of  max  EDF  entered  in  the  first  column.  (This  simple 
rule  does  not  hold  well  when  100%  of  the  max  EDF  is  required;  accordingly,  no 
value  is  entered  for  this  case. ) 


For  a  given  value  of  BT,  the  number  of  pieces  required  to  realize  .  99  (or 
less)  of  max  EDF  is  virtually  independent  of  the  particular  window  employed  in 
spectral  analysis.  This  independence  is  very  Important;  it  says  that  all  four 
windows  require  the  same  number  of  FFTs  in  order  to  realize  the  same  EDF 
and  that  selection  among  the  windows,  therefore,  can  not  be  based  upon  the 
number  of  FFTs  required,  but  must  be  based  upon  some  other  consideration 
such  as  side-lobe  level  or  size  of  FFT  (to  be  discussed).  An  approximate  rule 
of  thumb  for  the  number  of  pieces  required  is  given  by 


Number  of  pieces  required  —  1.75  BT  for  .99  of  max  EDF.  (39) 


For  large  BT  products,  the  number  of  pieces  required  to  realize  max  EDF 
is  significantly  larger  than  the  number  required  to  realize  99*  of  max  EDF; 
thus,  this  large  amount  of  additional  processing  yields  insignificant  improve¬ 
ment  and  is  to  be  avoided. 


• Very  small  values  of  BT  are  an  exception;  these  are  of  little  practical 
interest  however. 
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Table  4A.  Triangular  Data  Window 
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Quadratic  Data  Window 


Table  5 

REQUIRED  FRACTIONAL  OVERLAP  FOR  .  99  max  EOF 


Data  Window 

Triangular 

Cosine 

Quadratic 

Cubic 

Fractional 

Overlap 

.56 

.61 

.65 

.70 

The  required  fractional  overlap  la  greater  for  the  better-aide-lobe  win- 
do  wa.  Thua,  for  example,  to  realize  .99  of  max  EDF,  we  have  (virtually  inde¬ 
pendent  of  the  BT  product)  the  valuea  Hated  in  Table  5.  Notice  that  rather 
large  overlapa  are  required  for  aome  windowa. 

The  difficulty  of  realizing  general  fractional  overlapa,  such  a a  .56,  raises 
the  question  as  to  what  fraction  of  max  EDF  is  attainable  if  one  restricts  over¬ 
laps  to  a  few  easily  realized  overlaps  such  as  .50  and  .625.  This  question  is 
answered  in  Tables  6  and  7.  *  Table  6  indicates  that  the  cosine  data  window  at 
507  overlap  (a  popular  case)  yields  92rc  of  the  max  EDF.  However,  the  cubic 
data  window  realizes  only  7 Sft  of  its  potential  at  5<T  overlap.  Table  7  shows 
that  when  the  fractional  overlap  is  increased  to  5/9,  the  cosine  and  quadratic 
data  windows  realize  virtually  their  ultimate  capability.  If  the  overlap  is  in¬ 
creased  to  75  7 ,  the  cubic  data  window  then  realizes  its  max  EDF. 


Table  6 

ATTAINABLE  FRACTION  OF  max  EDF 
AT  .50  FRACTIONAL  OVERLAP 


Data  Window 

Triangular 

Cosine 

Quadratic 

Cubic 

Fraction  of 
max  EDF 

.96 

.92 

.85 

.75 

•These  values  are  not  attainable  from  Table  4  but  come  from  the  complete 
tabular  results,  of  which  Table  3  is  one  example. 
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Table  7 


ATTAINABLE  FRACTION  OF  MAX  EDF 
AT  .625  FRACTIONAL  OVERLAP 


Data  Window 

Cosine 

Quadratic 

Cubic 

Fraction  of 
max  EDF 

1.00 

.98 

.93 

Thus  far,  no  trade-off  has  been  necessary  to  realize  the  better  side  lobes 
of  the  smoother  data  windows;  that  la,  by  proper  choice  of  overlap,  equal  statis¬ 
tical  stability  is  attainable,  and  an  equal  number  of  FFTs  is  required,  for  all 
four  windows.  However,  there  is  one  trade-off  that  enters  as  follows:  if  the 
original  record  length  T  Is  composed  of  samples  of  a  process  at  increments* 
At,  more  samples  are  contained  in  the  segment  length  L  for  the  better-side- 
lobe  windows;  that  is,  the  number  of  samples  in  interval  L  is 


N 

s 


L  C 
At  *  B  At 


(40) 


employing  Eq.  (33).  Ng  is  directly  proportional  to  half-power-bandwidth 
constant  C  for  a  specified  sampling  increment  At  and  resolution  B.  Thus, 
using  Table  1,  the  cosine  data  window  requires  1. 13  times  as  many  samples  as 
the  triangular  data  window  requires.  The  corresponding  ratios  for  the  quad¬ 
ratic  and  cubic  data  windows  are  1.23  and  1.43,  respectively.  Thus,  better 
side  lobes  in  spectral  analysis  can  be  realized  at  the  expense  of  larger-size 
FFTs,  rather  than  at  the  expense  of  statistical  stability  or  number  of  FFTs. 
These  comments  hold  for  equal  half-power  bandwidths  of  the  windows. 

For  a  specified  sampling  increment  At,  desired  resolution  B,  and 
particular  window,  Eq.  (40)  will  generally  not  be  a  power  of  2.  Since  FFTs  run 


•The  sampling  increment  At  must  be  chosen  small  enough  to  avoid  alias¬ 
ing;  this  is  the  only  area  where  the  bandwidth  of  the  process  comes  into  con¬ 
sideration. 
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faster  when  conducted  at  powers  of  2,  It  is  recommended  that  the  desired 
resolution  B  be  changed  somewhat  (increased  or  decreased)  so  as  to  make  Ns 
a  power  of  2.  This  is  generally  a  tolerable  situation  since  B  is  often  a 
"guesstimate"  in  the  first  place. 


OPTIMUM  WEIGHTING  OF  INDIVIDUAL  SPECTRAL  ESTIMATES 

In  Eq.  (3),  the  p-th  estimate  |Yp(f)|2  of  the  power-density  spectrum  was 
weighted  equally  with  all  other  estimates.  In  this  section,  we  consider  whether 
unequal  weighting  will  yield  additional  worthwhile  variance  reduction.  Inas¬ 
much  as  the  edge  pieces  in  Fig.  1  are  weighted  only  once  by  a  data  window, 
whereas  the  Interior  pieces  are  weighted  more  than  once,  perhaps  heavier 
weighting  of  the  edge  pieces  will  yield  additional  stability.  The  power-density 
estimate  is  formed  as 

P 

G<f)«  wp|Yp(f)|2‘  (41) 


The  derivation  of  the  EDF  of  this  estimate  is  given  in  Appendix  C,  which  also 
presents  the  optimization  of  the  EDF  by  choice  of  weights  for  a  given  P, 
record  length,  and  window.  A  summary  of  the  numerical  results  is  given  in 
Table  8,  where  P  is  varied  up  to  84.  The  largest  value  of  EDF  attained  over 
that  range  of  P  is  quoted  in  Table  8,  except  for  BT  *  16  where,  with  the  ex¬ 
ception  of  the  triangular  data  window,  P  *  64  was  not  yet  great  enough  to 
reach  the  max  EDF  by  weighting. 


Table  8 

OPTIMUM  EDF  VALUES 


Data  Window 

BT 

2 

4 

00 

16 

Triangular 

4.37 

10. 12 

21.65 

44.72 

Cosine 

4.50 

10. 15 

21.44 

>43.87 

Quadratic 

4.07 

9.80 

21. 29 

>44.18 

Cubic 

3.82 

9.52 

20.94 

>43.72 

A  comparison  of  Table  8  with  the  max  EDF  values  of  Fig.  4  reveals  that 
very  little  is  to  be  gained  by  optimum  weighting,  except  for  small  values  of  BT. 
However,  small  values  of  BT  are  not  of  great  practical  interest  because  the 
estimates  are  very  unstable  statistically.  Also,  the  number  of  pieces  P 
required  to  realize  the  optimum  EOF  is  rather  large;  for  example,  in  order 
to  gain  an  improvement  in  EDF  of  0. 5  over  the  max  EDF  in  Table  4,  19  pieces 
are  required  for  the  cosine  data  window,  20  pieces  for  the  quadratic  data  win¬ 
dow,  and  15  pieces  for  the  cubic  data  window.  Moreover,  the  optimum  weights 
are  found  to  alternate  in  sign  for  some  cases,  causing  a  loss  in  significance. 


DISCUSSION 

This  investigation  of  four  good  data  windows  indicates  that  there  is  no  best 
window  for  spectral  estimation.  Rather,  there  is  a  trade-off  to  be  made  when 
choosing  a  window;  the  better-side-lobe  windows  require  larger-size  FFTs. 
When  the  proper  overlap  is  used  for  each  data  window,  the  selection  of  win¬ 
dows  can  not  be  made  on  the  basis  of  statistical  stability  or  the  number  of  FFTs 
required. 

The  quadratic  and  cubic  data  windows  are  simpler  and  quicker  to  compute 
than  the  cosine  data  window  in  the  time  domain  (but  not  in  the  frequency  do¬ 
main  as  regards  their  effects).  In  addition,  the  quadratic  and  cubic  windows 
have  better  side-lobe  behavior  and,  therefore,  merit  serious  consideration  for 
spectral  analysis.  However,  they  require  larger-size  (but  not  more)  FFTs 
than  does  the  cosine  data  window. 

The  reason  that  the  better-side-lobe  windows  do  not  require  more  FFTs 
than  do  the  other  windows  is  as  follows.  For  a  fixed  half-power  bandwidth  B, 
the  better-side-lobe  windows  require  larger  segment  lengths  L ;  however,  the 
corresponding  data  windows  tend  to  be  more  peaked  near  the  center  of  the  seg¬ 
ment  length.  In  order  to  utilize  a  given  record  length  for  maximum  statistical 
stability,  these  data  windows  must,  therefore,  overlap  for  a  greater  percentage 
of  the  segment  length.  It  turns  out  that  the  increased  segment  length  and 
increased  overlap  almost  exactly  compensate  each  other,  so  that  a  constant 
number  of  FFTs  is  required  regardless  of  the  window  selection. 

This  report  has  concentrated  on  the  variance  of  the  spectral  estimates.  In 
Appendix  D,  it  is  shown  that  the  covariance  of  spectral  estimates  at  two  differ¬ 
ent  frequencies  is  always  positive  but  is  essentially  zero  when  the  frequencies 
differ  by  more  than  the  width  of  the  spectral  window.  Thus,  spectral  estimates 
at  frequencies  farther  apart  than  B  are  statistically  linearly  independent  of 
each  other. 
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Appendix  A 


DERIVATION  OF  MEAN  AND  VARIANCE 


From  Eqs.  (2)  and  (3),  the  spectral  estimate  at  frequency  f  is 
G(f)  Jf iu  dv  exp  (-i2irf  (u-v))  x(u)x*(v)  • 

p =1  *' 


w[u  -y  -  (p-l)S]  W*[v  -y  -  (P-1)S]  . 


(A-l) 


(For  completeness,  process  x  and  window  w  are  allowed  to  be  complex. ) 
The  mean  value  of  'S'ff)  is  obtained  by  ensemble-averaging  Eq.  (A-l)  over  the 
possible  realizations  of  process  x.  Expressing  the  correlation  of  x  as  a 
Fourier  transform  of  spectrum  G,  we  find  the  average  value 


GW 


w  Cu  -  y  "  (p-i)s3  w*  [v  -  y  -  (p-l)S] 


-/d,  Gf!-*)  |W(v)|J 


(A -2) 


where  we  have  utilized  Eq.  (4).  If  spectral  window  |w|2  is  narrower  than  the 
finest  detail  in  the  true  spectrum  G,  Eq.  (A -2)  becomes 


E|  G(f)l  =  G(f) / dr  |W(^)|J 


(A -3) 


Equation  (A -2)  is  not  limited  to  Gaussian  processes  but  is,  in  fact,  true  for  any 
stationary  process. 


In  order  to  evaluate  the  variance  of  the  spectral  estimate,  we  start  with 
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Ej^2  (f)}  »  ~  Xj  ^^JJJJdudv  dr  ds  exp(-i2*f(u-v+r-s))  Ejx(u)x*(v)x(r)x*(s)[* 

w^-^- (P-I)S-J  w*[v- (p-l)S]  w[r-y-(q-l)S]  w*[s-i-(q-l)S]  . 

(A  -4) 

In  order  to  simplify  this  expression,  we  must  be  able  to  evaluate  the  fourth- 
order  average.  If  x  is  a  real  Gaussian  random  process,  the  average  in  Eq. 
(A-4)  becomes 

R(u-v)  R(r-s)  +  R(u-r)  R(v-s)  +  R(u-s)  R(r-v)  ,  (A-5) 

where  R  is  the  correlation  of  process  x .  (If  x  is  a  complex  envelope  of  a 
Gaussian  process,  the  middle  term  in  Eq.  (A-5)  is  absent  [12].)  When  we 
express  correlation  R  as  a  Fourier  transform  of  spectrum  G  and  substitute 
Eq.  (A-5)  in  Eq.  (A-4),  we  obtain 


w[u-j-  (P-1)S]  w*[v-j-(p-I)S]  w[r-y-(q-l)S]  w*  [s  -  j- (q-l)S]* 

ffdu  dv  G(m)  G(v)  jexp(i2TM(u-v)-H2jr»<(r-s))  +  exp(i2Tn(u-r)+i2jr»'(v-s)) 

+  exp^i2TM(u-s)+i2Tv(r-v))] 


=  ^  Jjfdudv  G(m)  G(v)  [Iw(f-M)|2  |W(f- *»)|2  +  exp(i27r(M+«')(p-q)S)* 


W(f-M)  W*(f+i/)  W(f+M)  W*(f-v)  +exp(i2ir(M-»')(p-q)S)  |w(f-M)|2  |w(f-^)(2]  , 

(A -6) 

using  Eq.  (4).  The  first  term  in  Eq.  (A-6)  is  recognized  from  Eq.  (A-2)  as  the 
square  of  the  mean  of  u.  Therefore, 
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Var{$(f)|  =“^  £  d  /v  G («)  exp(i2xM(p-q)S)  W(f-M)  W(f+M) 

P  p=l  cplV 


P  P  y*  iZ 

— rT'Y'  /dM  G(m)  exp(i2rM(p-q)S)  lw(f-M)l  .(A-7) 


P  p=l  q=l 


Now  if  f,  the  frequency  of  interest,  is  greater  than  the  bandwidth  of  the  spec¬ 
tral  window  |wl2  (i.e. ,  a  couple  of  resolution  cells  away  from  the  origin), 
then  W(f-M)  and  W(f+>*)  do  not  overlap  significantly.  Letting  B  be  the  half- 
power  bandwidth  of  the  spectral  window  |W|2,  we  therefore  have  the  excellent 
approximation*  for  f>B, 


P  P  /• 

Var|6(f)}  S  — 2  SSI  jjdM  G(m)  exp(i2xM(p-q)S)  IW(f-* 


(A-8) 


(For  f  =  0,  the  two  terms  in  Eq.  (A-7)  are  equal  if  data  window  w  is  real,  in 
which  case  Varf&(0)|  is  double  that  given  by  Eq.  (A-8)  at  f  =  0.)  Making  the 
change  of  variable  k  =  p-q  in  Eq.  (A-8),  we  obtain 


Var|G(f)}=-^  £  (l  -  fd»  exp(i2xMkS)  G(m)  |W(f-M)|! 

k=  -(P-1) 


(A-9) 


But  if  the  bandwidth  B  of  spectral  window  |W|2  is  narrower  than  the  finest 
detail  in  spectrum  G,  the  integral  on  m  in  Eq.  (A-9)  can  be  approximated  by 


>/d 


G(f)  /dM  exp(i2w-MkS)  (W(f-M)l ' 


=  G(f)  exp(i2xfkS)  $^(kS)  , 


(A-10) 


where  we  have  utilized  Eqs.  (4)  and  (7).  Then  Eq.  (A-9)  becomes 


*When  x  is  a  complex  envelope,  Eq.  (A-8)  is  exact;  see  comment  under 
Eq.  (A -5). 
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Var|'G'(f)}  =  G2(f) -p-  E  (l-|i(kS)|2  . 

k=-(P-l)  W  1 

This  equation  is  similar  to  that  given  in  [9,  p.  7l]  . 

Equations  (A-3)  and  (A-ll)  are  the  main  products  of  this  appendix. 


(A-ll) 
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Appendix  B 


CORRELATIONS  OF  DATA  WINDOWS 


The  correlation  of  the  normalized  data  window  u  is  given  by  Eq.  (11)  as 


Vt>  */« 


dt  u(t)  u*(t-r)  . 


(B-l) 


This  quantity  is  required  for  the  calculation  of  EDF,  K,  in  Eq.  (14).  Since 
u(t)  =  0  for  iti  >1/2,  then  4>u(t)  =  0  for  m  >  1.  Thus,  for  the: 


Triangular  Data  Window 


2  3  1 

0  (r)  1  '  6r  +  6 m  ,  m  <  - 


<t>  (0)  <3  1 

u  2  (1  -  I r | )  ,  “  <  lr|  <  1 

* 


(B-2) 


Cosine  Data  Window 


*a,r>  2 


[l  +  jcos(2irr)J  sin(2 


7r  l  r|) ,  l t\  <  1 


(B-3) 


Quadratic  Data  Window 


fid-irl^sQ^r),  |  <m<  1 


0u(r) 


V°> 


J£|r,sf 


(B-4) 


I 
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Cubic  Data  Window 


The  forms  for  the  correlation  of  the  quadratic  and  cubic  data  windows  are 
compact  and  very  useful  for  computer  programming. 
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Appendix  C 


OPTIMUM  WEIGHTS  FOR  EDF 


The  estimate  of  the  power-density  spectrum  is  given  by  Eq.  (41).  By 
generalizing  the  results  in  Appendix  A,  we  have 


VarjGtf)} 


■  £  r*\fi 

k=-(P-l)  U 


dn  exp(i2rMkS)  G(m)  |W(f-M)|^ 


(C-l) 


where 


q 


(C-2) 


The  sum  is  over  all  non-zero  terms.  Utilizing  the  same  assumptions  used  in 
Appendix  A,  we  have 


P-1 


Var|G(f)}2G2(f)  £  T  Id .  <kS)|2  . 

k=-(P-l)  '  ' 

EjG(f)|3G(f)^wpew(0)  , 


(C-3) 


(C-4) 


yielding 


K  =  2 


P-1 

£ 

k=-(P-l) 


(C-5) 


£  \\ 


35 


where 


“k 


♦...(0) 


<C-«> 


Partially  differentiating  K  with  respect  to  w*  [l3,  Appendix]  and  noting  that 
the  absolute  scale  of  weights  jwpj  does  not  enter  tn  R,  we  see  that  the 
optimum  weights  must  satisfy  the  equation 

£  M  w  «  1,  l£j*P.  (C-7) 

k  J 


If  we  define  the  matrices 


M 


1  <  m,  n  <  P  , 


1T  »  [l  1 .  . .  l] .  wT  «  [Wi  w2  . . .  wp]  . 


(C-8) 


then  the  optimum  weights  are 


w  *  M"  ^ }  , 


and  the  optimum  EOF  is 


K«2 


(C-9) 


(C-10) 


i 
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App— rill  D 

COVARIANCE  OF  SPECTRAL  ESTIMATES 


A  generalization  of  the  technique  in  Appendix  A  leede  to  the  following 
expreeeion  for  the  covariance  between  spectral  eetl mates  at  frequencies  f  j  and 

f2 


Covf6(f  j),  ^(f2)}  *  “7^  j  J**  G(M)  Wtf^M)  W(f2-*-M)  exp(i2w-M<p-q)S) 


d m  G(m)  W(f1*M)  W*(f2~M)  exp(i2*»4p-q): 


f 


P-D 


Now  if  f^+f2  is  greater  than  the  bandwidth  of  the  spectral  window ,  W(f  ^-m) 
and  W(f2+**)  are  essentially  non-overlapping.  Then, 

P-1 

1) 


covai-  £ 
k—  (P- 


(*  MJ* 


d*  G(m)  W (f  j— m)  W*(f2-M)  exp<i2*»kS) 


^  -P-2) 


This  quantity  is  always  positive.  However,  it  is  very  small  when  |f2-f^|>B 
because  W(f1-«)  and  W(f2-M)  do  not  overlap  than . 

When  true  spectrum  G  varies  but  slightly  over  a  frequency  range  B. 


P-1 


CoviG^IGdj)  £  (l  -T)  Wl  • 
k— (P-1) 


P-3) 
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where 


X(r.f) 


■/ 

-/• 


du  axp(l2rur)  W(m+OW*(m) 
exp  (— L2wft)  w(t)  w*(t-r) 


P-4) 


is  the  ambiguity  function  of  window  w .  Again,  if  |  fi-f2 1  >  B  ,  Eq.  P-3)  is 
essentially  zero,  as  shown  by  the  first  form  in  Eq.  P-4). 


As  an  example,  for  the  cosine  window  and  0%  overlap,  the  covariance 
coefficient  (ratio  of  Cov  to  the  square-root  of  the  product  of  variances)  is 

9  ’  36  ’  °*  °»  •  •  •  •  *or  |*2~*l|  *0,  L’L’L’L’  ’  resPectively* 

2 

Thus  spectral  estimates  —  Hz  apart  are  essentially  uncorrelated.  For  50^ 

L 

overlap  (and  large  P),  the  corresponding  covariance  coefficients  are  slightly 
larger,  being  1,  .495,  .068,  .005,  0 . 
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Estimation  of  Cross- 
Spectra  Via  Overlapped 
Fast  Fourier  T ransform 

Processing 


Albert  H.  Nuttall 

ABSTRACT 

The  optimum  overlap  to  be  used  for  estimation  of 
cross-spectra  via  FFT  processing  of  windowed  data 
is  shown  to  be  identical  to  that  for  estimation 
of  auto-spectra.  In  addition,  a  useful  geometric 
interpretation  of  the  random  errors  in  cross- 
spectral  estimation,  and  their  covariances,  is 
furnished. 
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ESTIMATION  OF  CROSS-SPECTRA  VIA  OVERLAPPED 
FAST  FOURIER  TRANSFORM  PROCESSING 
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INTRODUCTION 

In  a  recent  report  (Ref.  l) ,  the  use  of  overlapped  fast  Fourier 
transform  (FFT)  processing  of  windowed  data  for  estimation  of  auto-spectra 
was  thoroughly  investigated.  It  is  now  desired  to  extend  these  results  to 
estimation  of  cross- spectra.  The  method  of  overlapped  FFT  processing  is 
often  used  for  cross- spectral  and  coherence  estimation  with  good  results 
(see,  for  example,  Ref.  2);  here  we  wish  to  give  analytical  back-up  to  its 
optimality. 


PROBLEM  DEFINITION 

Consider  that  stationary  random  processes  x(t)  and  y(t)  have  been 
observed  for  a  time  interval  of  T  seconds,  0<t-vT.  Let  the  auto-spectra 
of  the  processes  at  frequency  f  be  6**(f)  and  (f), respectively,  and 
let  the  (complex)  cross-spectrum  be  <r,,(f) •  The  method  of  estimating  the 
cross- spectrum  is  discussed  thoroughly  on  pages  2-4  of  Ref.  1,  and  will 
not  be  repeated  here;  the  reader  is  referred  to  that  reference  for  nota¬ 
tion,  related  past  work,  and  qualifications.  We  let  w(t)  denote  the 
fundamental  data  window,  and  S  the  shift  of  each  successive  overlapped 
window,  and  define 

Ktt)  '  *  (1) 

where  ?  is  the  total  number  of  overlapped  segments 
interval.  The  estimate  of  the  cross- spectrum  is* 

J  r  K«1 

where 

X„(^  =  Jdt  f  (-i  2irp$  X  Hr)  X  16 , 

Y*  s  Jdt-  «-*?>  yfc)-  (3) 

(The  continuous  versus  discrete  versions  of  (3)  are  discussed  on  page  4 
of  Ref.  1.) 

The  estimate  in  (2)  is  a  complex  random  variable  (HV)  which  it  is  hoped 
will  approximate  the  true  cross- spectrum  fi^(f)  for  sufficiently  large  T 
and  propqy  choice  of  shift  S  .  The  problem  is  to  evaluate  the  stability  of 
the  IV  (r,3(f),  and  optimize  the  stability  by  choice  of  overlap.  At  the 


fitting  into  the 


(2) 


^Carets  denote  random  variables,  and  integrals  without  limits  are  over 
the  range  of  non- zero  integrand. 
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same  time,  we  wish  to  investigate  the  dependence  of  the  stability  on  the 
fundamental  parameters  such  as  observation  time  T ,  desired  frequency 
resolution  "B  ,  spectra  £w(f),  £^(f),  etc. 

PROBLEM  SOLUTION 

A 

In  Appendix  A,  the  mean  of  RV  GL.(f)  is  determined  to  be 

|w(Ml 

where 

and  we  have  assumed,  with  no  loss  of  generality,  that  *•  I. 

The  exact  relation  (4A)  indicates  that  the  mean  is  equal  to  the  convolu¬ 
tion  of  the  true  spectrum  6*,(f)  with  a  spectral  window  JWtf-)l*.  (Desirable 
aspects  of  windows  are  discussed  on  pages  10-18  of  Ref.  1.)  The  approxi¬ 
mation  (4B)  is  valid  when  the  frequency  width  B  of  spectral  window  ]W(f))1 
is  narrower  than  the  finest  detail  in  the  true  spectrum  £*„(f) .  These 
results  are  not  restricted  to  Gaussian  processes,  cut  hold  for  any  station¬ 
ary  processes  x(t)  and  y(t) . 

We  now  define  the  zero-mean  complex  "RV 

|(f'»  («) 

This  RV  measures  the  deviation  of  the  estimate  of  cross-spectrum  from  its 
true  value.  In  Appendix  A,  the  following  two  relations  are  demonstrated: 

Ell  f (01]  *  (l-  (K5)f ,  (7.) 

w  n  o-f)  |wkC  (7B, 

where 

s£<Hwte*f(fc-T).  (8) 

Three  assumptions  are  required  for  the  validity  of  (7):  the  processes 
x(t)  and  y(t)  are  jointly  Gaussian;  the  frequency  f  of  interest  must  be 
greater  than  bandwidth  B  of  window  IVf?)!*;  and  bandwidth  B  must  be  less 


(u) 

(4B) 

(5) 
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than  the  narrowest  detail  in  and  <c«j(f).  (A  case  where  B 

is  greater  than  the  narrowest  detail  is  discussed  later.) 


A  measure  of  the  stability  of  a  W  is  afforded  by  its  equivalent 
number  of  degrees  of  freedom  (EDF) ;  see  Ref.  3,  p.  22.  For  a  complex  W  z, 
we  extend  the  definition  to 


Et>F  *  2 


E[U-  el^n 


W 


The  denominator  of  (9)  could  be  interpreted  as  the  variance  of  complex  "RV  z. 
Interpreting  z.  as  ^(f),  and  using  (4B),  (6),  (7A),  (8),  and  Parseval's 
Theorem,  there  follows  for  the  EDF  at  frequency  f, 


ET>F  =  K, 


(10) 


where 


The  quantity  tf„„(f)  is  the  complex  coherence  at  frequency  f  of  processes 
x(t)  and  y(t).  Equation  (10)  indicates  that  the  EDF  at  frequency  f  of  RV 
6„u(f)  is  given  by  the  product  of  two  factors,  one  frequency-dependent 
solely  on  the  processes'  spectra  (over  which  we  have  no  control*) ,  and  the 
other  depending  solely  on  the  method  of  processing,  but  being  frequency- 
independent.  Specifically,  K  depends  on  the  number  of  pieces  P  in  the 
average  (2),  the  shift  5  of  each  window  in  (l),  and  the  autocorrelation 
4>W(T)  of  the  window  w(t).  Furthermore,  this  factor  K  is  precisely  the 
sane  quantity  encountered  in  Ref.  1  as  the  EDF  for  auto-spectral  estima¬ 
tion.  Therefore  all  the  results  of  Ref.  1  on  maximization  of  K.  by  choice 
of  shift  S  are  immediately  brought  to  bear  on  the  present  problem  of 
cross-spectral  estimation.  Thus,  the  optimum  choice  of  overlap  for  cross- 
spectral  estimation  is  identical  to  that  for  auto-spectral  estimation.** 


*Linear  filtering  of  x(t)  and/or  y(t),  such  as  pre-whitening,  would 
not  affect  iY.«,(f)|x  ,  and  therefore  not  affect  EDF.  A  related 

observation  on  this  aspect  is  made  in  Ref.  A,  p.  379. 

**More  generally,  since  all  the  variances  of  the  estimates  in  the  following 
sections  depend  inversely  on  K  ,  maximization  of  K  is  appropriate, 
regardless  of  the  particular  definition  of  stability. 
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VARIANCES  OF  QUADRATURE  COMPONENTS 
OF  CROSS-SPECTRUM  ESTIMATE 


The  zero-mean  complex  "RV  $  (f)  in  (6)  is  the  random  error  in 
estimation  of  the  cross-spectrum.  The  diagram  in  Fig.  1  depicts  the 


Fig.  1.  Complex  Random  Variables  in 
Cross-Spectrum  Estimation 


relationships  between  the  various  complex  RYs  .  Here 

"Rnjl-P  *  ^3) 

is  the  true  phase  of  the  cross-spectrum. 

It  is  convenient  to  represent  complex  T?V  gif'*  in  terms  of  its 
real  and  imaginary  components, 


as  shown  in  Fig.  2.  Also  depicted  are  the  projections  of 


(14) 


1 


(f) 


on  a 


Fig.  2. 


Projections  of  Random  Error 


J 


(f 


N 

/ 
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different  rectangular  coordinate  system' (1(f) ,b(f) )  aligned  with  the 
direction  of  the  true  phase  T^(f)  of  the  cross-spectrum.  That  is,  we 
also  represent 


where  a(f)  and  b(f)  are  real.  We  note  immediately  that  all  the  "RVs  in 
(14)  and  (15)  have  zero  mean;  this  follows  from  the  definition  (6) . 

In  order  to  evaluate  the  covariances  of  these  various  quadrature 
components,  we  first  note  from  (7),  (12),  and  the  fact  that  4>wfo)  has 
been  assumed  unity,  that 


Etlftfll]  =  2ffv,(f)S3S(f)/K, 


(16A) 

(16B) 


Equation  (16A)  (or  (7A))  affords  the  interesting  interpretation  that  the 
average  squared- length  of  the  random  error  §(f)  in  the  estimate  of  the 
cross-spectrum  is,  in  fact,  independent  of  tne  true  cross- spectrum,  but 
depends  on  the  auto-spectra  of  the  two  processes  involved.  (See  also 
(A15)  more  generally.) 

Substituting  (14)  in  (16),  there  immediately  follows 

E[  p]  =  ±  /k 

--  [  ft.  B  tf)  ±  fc'{  ft,,  (f )}  ?  -Li  Ml]  /k 


-  (f)|j  i  "Re  tf 5Q  / K ,  l'17A'> 

=  (i7B) 

The  quantities  in  (17)  are  the  covariances  of  the  real  and  imaginary  parts 
of  the  cross-spectrum  estimate;  that  is,  using  (6)  and  (14), 


v«r  *  E 
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Equations  (17)  and  (18)  are  basically  identical  to  Ref.  4,  p.  378; 
however,  the  scale  factor  K  is  different. 

The  projections  a(f)  and  £>(f)  have  simpler  properties  than^(f)  and 
^(f) .  From  (15),  (16),  and  (13),  there  follows 

E{  H  *  ^ s*>w  1 1  e'^‘ ^ /K 

e{&(PaU&]  *  0.  (19) 

Thus  the  projections  of  the  random  error  along  and  perpendicular  to  the 
direction  of  the  true  phase  of  the  cross- spectrum  are  uncorrelated. 
Furthermore,  the  variance  of  the  projection  a(f)  along  the  direction  of 
the  true  phase  is  always  greater  than  or  equal  to  the  variance  of  the 
projection  *b(f)  perpendicular  to  the  true  phase.  In  fact,  if  the  magnitude- 
squared  coherence  is  unity  at  some  frequency  f,  ,  then  the  variance  of  b(f,  ) 
is  zero;  in  this  case,  all  random  fluctuations  of  (x„.(f|  )  lie  along  the 
line  with  phase  PX(J(v  in  Fig.  1. 

On  the  other  hand,  if  the  magnitude- squared  coherence  is  zero  at  some 
frequency  f 2 ,  then  the  variances  of  a(ft)  and  b(ft)  are  equal;  in  this 
case,  the  "scatter"  of  random  perturbations  g(ft)  in  Fig.  1  is  a  circle 
centered  at  the  origin. 

Generally,  the  scatter  of  random  perturbations  is  like  an  ellipse, 
as  depicted  in  Fig.  3,  where  the  major  axis  of  the  ellipse  lies  on  the  line 


with  phase  'R.^(f).  If  Xii  2l( f )  and  t;(f)  are  Gaussian,  as  they  would  be 

(approximately)  if  K  is  large,  then  the  elliptical  diagram  can  be  made 
quantitative  and  interpreted  as  contours  of  iso-probability. 
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VARIANCES  OF  AMPLITUDE  AND  PHASE  ESTIMATES 

Estimates  of  the  amplitude  and  phase  of  the  cross- spectrum  are 
available  according  to 

(20) 

In  order  to  estimate  the  means  and  variances  of  A^,(f)  and  PtJ(  f ) ,  ue 
will  assume  that  the  scatter  of  points  in  Fig.  3  is  small  in  comparison 
with  the  distance  out  to  the  center  of  the  ellipse.  That  is,  using  (9' 
and  (10),  we  will  assume  that 

''2i> 

This  requires  that  the  product  of  observation  tine  and  desired  frequency 
resolution  be  much  larger  than  unity  (Ref.  1),  but  it  also  requires  that 
the  magnitude- squared  coherence  not  be  too  small  at  the  frequency  of 
interest. 

We  first  utilize  (20),  (6),  and  (15)  to  express 

+  aW  *■  i  Wj]  .  (22) 

Then 


Ax^Cf)  *  |  ♦  W  +  i  t  W| 

*  1  4  • 

P^  If''  =  ?»y(f)  4  4  5  W 

:  p  lf\+  _ML  . 


(23A) 


(24B) 


Equations  (23A)  and  (24A)  are  actually  exact,  whereas  (235)  and  (245) 
require  the  assumption  of  (21).  Combining  (233),  (243),  and  (19),  there 
follows  immediately 


V«r ■  [C„W6„W+|6,»)|*(1/K 

=  0. 


(25A) 


(253) 

(25C) 
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Thus  the  amplitude  and  phase  estimates  are  uncorrelated.  The  variance 
of  the  phase  estimate  in  (25B)  is  much  smaller  than  unity,  when  we  recall 
that  assumption  (21)  is  necessary  for  (25)  to  be  true;  that  is,  the  denom¬ 
inator  of  (25B)  must  always  be  large.  It  should  also  be  noticed  that  none 
of  the  covariances  in  e^s.  (19)  or  (25)  depend  on  the  actual  phase  of  the 
cross- spectrum,  but  only  on  its  magnitude.  Some  additional  relations  on 
the  covariances  of  estimates  of  coherence  are  given  in  Ref.  4,  pp.  378-9; 
of  course,  the  phase  estimates  of  complex  coherence  and  cross- spectrum  are 
identical. 

EFFECT  OF  CLOSELY  SPACIE  TONES 


All  the  earlier  results  have  presumed  that  the  bandwidth  B  of  the 
spectral  window  is  narrower  than  the  finest  detail  ir.  the  spectra 

{rw(f),  S»j(f5,  and  £,(f).  We  now  consider  a  case  where  this  is  net  so, 
and  investigate  the  variance  of  the  cross-spectrum  estimate. 


Suppose  the  spectra  are  approximately  pure  tones: 

cyp  *+P,[lKW(f ;*6: 


Then  from  (A27)  in  the  appendix, 


et'JW}  5  * ^  M I, 


.27' 


* 

which  can  be  interpreted  as  the  variance  of  the  complex  RV  6rn.f)  .  New 
if  and  if  the  frequency  f  of  interest  lies  near  or  between  f,  and 

f«,  then  the  window  functions  in  (27)  are  near  their  peak  value  Wlo)  .  Also, 
if  If, -f,l  <  (2PSr'  -  (2T)*',  then  the  bracketed  term  in  .2”  is  near  unity. 
Then  the  variance  in  the  cross-spectrum  estimate  is  large;  in  fact,  it  has 
the  same  value  as  for  V*  I  ,  no  averaging.  Yet  the  true  cross- spectrum  say 
in  fact  be  zero.  Thus  estimation  of  the  cross- spectrum  will  be  in  error, 
even  for  a  large  72  product,  in  a  frequency  range  near  f,  and  f, .  It  should 
be  noted  that  this  noisy  estimation  case  requires  the  frequency  separation 
of  the  tones  to  be  less  than  (27)*’,  not  (2L)  ' ;  thus  the  tone  separation  must 
be  much  closer  than  the  fundamental  resolution  of  B  *  L ' . 


If  the  tone  separation,  on  the  other  hand,  satisfies  |f,-f^l>(PS'  =7*, 

then  the  variance  of  estimation  is  greatly  reduced,  as  inspection  of  the 
bracketed  term  in  (27)  indicates.  In  fact,  (27)  becomes 


Eflj’W}  5  t VK'f  5vf.- v 57’, if  vy,  <  <*s 


which  has  the  desirable  T'1  dependence  cr.  the  number  of  pieces  ir. 
average  (2).  Gf  course,  when  \fx  - f,)  >  B  ,  then  the  window  function 
decay  rapidly  and  indicate  a  greatly  reduced  variance,  ever,  if  B 
than  the  finest  detail  in  the  spectra. 


.3' 


the 

s  in  2” 
is  greater 
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APPENDIX  A.  DERIVATION  OF  MCKENTS 

From  (2), 

(AD 

But  from  (3), 

E’fs.PY*^  *  If  «,  (A2> 

where 

^(r)  =  E[x«Hvj*(fc-4  (A3) 

is  the  cross- correlation  of  x(t)  and  y(t) .  We  are  allowing  ail  processes 
and  windows  to  be  complex,  for  generality,  and  have  utilised  j oint-station- 
arity  in  (A3).  The  cross- spectrum  of  x(t)  and  y(t)  is 

^  2»rf  fr)#  (A4) 

Utilizing  (A4)  and  (5)  in  (A2),  there  follows  for  (Al), 

*e  have  also  employed  the  fact  that 

"W*  If)  -  (y 


which  fellows  from  {!'.  Equation  (A5)  is  the  fundamental  relation  for 
the  mean  of  the  cross-spectral  estimate.  However,  when  ^  is  less  than 
the  finest  detail  in  £,,(f), 

-  <s#>.  (*■: 

upon  setting  I, without  loss  of  generality. 

To  evaluate  the  variance  of  €  (f),  the  following  steps  are  required; 

from  (2),  ^ 

v 

-  -jj  JL  EfxWVPJlWY.If^.  (as) 

But  the  statistical  average  in  (A8N  is,  using  13), 

=  :j!.r  2^  ffj'KA') 

•  (aq' 
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Further,  the  statistical  average  in  (A9)  is  given  by 

eUI *  M,)  -4-X  M*>, 

where  we  have  assumed  that  x(t)  and  y(t)  are  joint  Gaussian,  and  have 
defined  _  e  . , 

X.trt  •  Et«»»  (t-Tl), 

*atrU 

(R^frl  »  £  ?  i  H  3  (t-rij  ^ 


(A10) 


;a:i: 


' The  last  function  in  (All)  is  generally  different  from  [k}'  Denoting 
the  Fourier  transforms  of  the  three  functions  in  .All!1  by  fi\,(f',  Q  \  f  , 
A,„(f)  respectively,  in  a  manner  similar  to  (A4),  (A1C)  becomes  * 

EX***!  =  *>^-1 VAA)]  £\*  (v) 

♦“  2*?*  K,-*C3  «p[-i  ^(4i--0]  M 

*  **?  D  **  |pw[-i2»»  ff,-+5  jf*  (y^  A1Z 

£»»  f  and  Stj'f'  must  always  be  real.)  Substituting  (AH'  ir.  ,A°  ,  there 
follows 


*  W  l  ff,-  -  I  2-r>> 

+  ff»  i>*>  w  2-(M  (f  - .  :-v  fi -o] 

+  ^  evf.  J  (4-A)  -  ,  7wv  ■>,  -+i)j  j 


s3V  M 

■v  W*  (f If-v1  (y  fvf  -  y 

-*■  i»j  If  ?)\*  (t  -»>  K  ?  * 
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+  W  Sffv)]  «vft  2wS{t-*)[y-»}\ 

+  Jbt^r)  J?*  (*)  V\l  if  -  y)  \?$+  »)  W  (*~  *) « » f[\  2*5  (*-»')  (^ ♦  »»3 *] 


+  dp  (r^  !^)V/(f-^|  t»p|j  ^(k-w^Sj^  c)p  Sjj(p);W{f-»^j  fxpT-i3»r  ^ 

+  Ay  "^)yj!^*>*)tJif  j  >ir(k-»)p5)  c<Jv^^  [v)  (f  -  ►*)  ^/  i*f  *p)  P*yj  Vw  .k-^v  S  ,  ( A1  3) 


.'per.  substitution  of  (AI3)  in  (A8),  we  obtain 


l 


t \  \  ^  (P 'l  *  ,  [  Jk  ®«5  W ',  Vv  (f  -/) | 

+  «r/,  >W (K-«V S  ((y  exp-.Vik-^Sj 

K  i  [■  '  1  *•  **  jj 

+  dv  ^  ff  VVV  !f+  *•}  *>Tj  >(*- dv» (v^W^^V 

=;e!s<  * 


(AM) 


where  we  have  used  vc  .  If  the  frequency  f  of  interest  is  greater  than  the 
bandwidth  B  cf  the  window  !Vrf(P]*,  then  W(  f-p '  and  W I  f— /■!'  do  not  overlap  on 
the  t*- scale.  Then 

EfyW]  *  ^rI|L{^  )^5] 

.  r<^  -y)]1  top  r-iV»(*-"V  5j 

=  «»p(t  IV^s'' 

•  e^f-VK,s\  , a:c' 

This  is  a  general  relation  for  sty'Wft  ;  it  Wi  11  be  noticed  to  be  independent 
cf  cross- spectrum  6r„^(  f ' ,  and  depend  only  on  auto-spectra  fir„(f''  and  f '  . 
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Also,  there  la  do  need  to  knov  j2)_.(f) . 

TJ 

If  B  la  leaa  than  the  narrowest  detail  in  £«■(*'  and  near  the 

frequency  f  of  Intereat,  the  Integral  on  ^  in  ( A1 5 '  becomes  approximately 

|jr%(^  tvp(i  km  (k  S)(  vAl6' 


and  . A1 5  yields 

*  s.r -i- r) ; ♦.:«£!’ .  -- 

Since  £,lf'  is  a  complex  KV,  it  is  necessary  a.sc  tc  evaluate  the 
quantity  Ejf' ,  in  addition  tc  .AS  ,  in  order  to  oonp.ete  the  second- 
order  moment^.  Sue  tc  the  similarity  tc  the  derivations  above,  the  steps 
will  be  presented  in  a  more  ourscry  fashion. 

e?  ^  Ji  BlTfcWY:spT.^r;^  ( Ai8) 


^ *  LTj  ^  ^ Oj  'S.  ^ v  S  *4, 

•  E  j  x  ^  ^  x  'h^  **  W'l 


(A1Q) 


^Uj«}  =  +  [±x- C;  +  t,-^ 


«  /  di>^  kr^  (j*)  ex ^ i  >rp(V  tyj  •  *) 

•V  e«rp['>rH 

+  exf  fi'VwyH’,-^  .  T\, 


[>  <•{*,, y  MK  (f.  e) 

+ M  K  '<C  »t*u; 


(A20) 


»yi)1  v. f  - y  w'}-*' 


4-  T«,  y  2 - mn,\N*  y'A  ;„f  r  >»  t. 

w  *■  ^ 
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+ .  dw  (f  txf [i  "i*  i<-  [ &»  (*»)  W*  (f  -  p)  W*ff^  p)  enp[-  i  ( k-*.}  »  5] 

+ „  V  (f  -  pV  «*p[i  VvW--Ysj  1  ^(p) !  W (f -»)f  exf  [-  i  vIic-mVs],  ( A2  i  ) 


E  {  ^  (P)  =  dw  S1^ I/* S' ^-p)l  "j 
«^L  jj  dp  ji„  (f  -,h)Vv  tf  ♦^♦x^iViX.^sj'  (jp^*  (p)W*  (^-r)Vj*tf*pVx^[-i  ^x-*)pSj 

^ -vYjVv s]„ <Jp  fti,W!w(f-p)'a <xf[- ;'W*-«V  5$ 


=LE^Kr +  eM- 


(A22) 


If  f  is  greater  than  B  , 

E?  J*(P} 

.  j 4p  £•,,(»  1^1  [-a^-^vs] 


('  ■  P1) j  V  Si  [i W s) 

•  dp  ^[-,2-HcpSy, 


(A23) 


This  genera^,  relation  for  e{  j'K)  depends  only  on  the  cross- spectrum  <rM(f), 
and  not  on  the  auto-spectra  £w(f)  and  fiu(f' .  Also,  there  is  no  need  to 
know  J^(f)  or  J^lf)  • 

If  B  is  less  the  narrovest  detail  in  6r«, ( f )  near  the  frequency  f  cf 
interest,  the  Integral  •*  p  in  (A23'  becomes  approximately 

(x.5^exf(,2^kS)  <t*(V5),  (A24) 


and  ?A23'1  yields 


e  ^  y  (p)  ? 


rs  1 

X,j 


-jaV*  \sV 

P . v  i 


r»-p*i 


(A25) 
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For  the  case  where  x(t)  and  y(t)  contain  pure  tones  (eq.  (26)  of 
text),  the  general  relation  (A15)  for  Ef) takes  on  the  following  form: 
first  the  integral  e*  in  (A15)  is  approximately 

*2*^ S),  (A26) 

Then  (A15)  becomes  ( 

*■  ] ,  f*27' 

using  Hef.  5,  (418)  and  (428) j  this  relation  is  used  in  (27). 
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Processing 


Albert  H.  Nuttall 


ABSTRACT 

The  time-limited  nonnegative  data  windows  that 
minimize  the  bias  in  auto-  and  cross-spectral  esti¬ 
mation  of  stationary  random  processes  by  means  of 
overlapped  Fast  Fourier  Transform  (FFT)  processing 
are  derived  for  a  variety  of  constraints.  When  the 
time  duration  L  of  the  data  window  is  constrained, 
the  optimum  data  window  is  ( 2/L)  1/2  cos  (irt/L), 

| t |  <_  L/2 ;  when  the  equivalent-noise  bandwidth  is 
constrained,  the  optimum  data  window  is  (8/3L)1/2 
cos2  { irt/L ) ,  which  is  the  Hanning  window;  when  the 
half-power  bandwidth  is  constrained,  the  optimum 
data  window  is  L"1'2  [1.682  +  4.261  cos  (4.434  t/t)- 
4.337  cos  (3.552  t/L) ) ,  which  is  very  similar  to 
the  Hanning  window;  and  when  the  root-mean-sauare 
bandwidth  is  constrained,  the  optimum  data  window 
is  4/(5L)*'2  cos3  (ut/L).  In  the  three  bandwidth- 
constrained  cases,  the  window  duration  L  is  adjusted 
to  meet  the  constraint. 

The  Hanning  window  is  a  reasonable  compromise  for 
achieving  minimum  bias,  because  in  addition  to  being 
the  optimum  for  one  bandwidth  constaint,  it  is  very 
close  to  the  optima  for  two  other  bandwidth  con¬ 
straints.  The  relative  merits  of  the  spectral 
characteristics  of  the  windows  are  also  discussed. 
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MINIMUM- BIAS  WINDOWS  FOR 
SPECTRAL  ESTIMATION  BY  MEANS  OF 
OVERLAPPED  FAST  FOURIER  TRANSFORM  PROCESSING 


INTRODUCTION 


The  selection  of  good  data  windows  in  spectral  estimation  of  stationary 
random  processes,  to  minimize  leakage,  is  an  important  consideration  and  has 
received  much  attention  [1-8].  In  [6],  a  thorough  investigation  of  fourgood  data 
windows  revealed  virtually  the  same  variance-reduction  capabilities  of  over¬ 
lapped  Fast  Fourier  Transform  (FFT)  processing  when  the  proper  overlap  was 
used  for  each  window.  The  ultimate  variance  reduction  of  this  direct  procedure 
was  also  demonstrated  to  be  identical  to  that  attained  by  the  older  (indirect) 
analysis  procedure  in  [4]. 


In  this  report,  attention  is  focused  on  the  bias  in  the  estimation  of  power 
density  spectra  by  means  of  overlapped  FFT  processing.  Specifically,  the  bias 
is  minimized  by  the  choice  of  data  windows  that  are  restricted  to  be  time-limited 
and  nonnegative  and  are  subject  to  either  a  time-duration  constraint  or  a  band¬ 
width  constraint.  These  results  complement  and  extend  those  of  [8]  for  the  in¬ 
direct  approach  to  spectral  estimation. 


PROBLEM  DEFINITION 


The  overlapped  FFT  method  for  spectral  estimation  and  the  reasons  for  its 
use  are  documented  in  [6].  The  mean  of  the  spectral  estimate  is  given  by  [6, 
eq.  (5)]  for  auto- spectral  estimation,  and  by  [7,  eq.  (4A)]  for  cross-spectral 
estimation.  In  both  cases,  the  mean  takes  the  form* 

E  {6(f)}  =*  /dv  G(f-v)  |  W(v)|2  ,  (1) 


where  G(f)  is  the  estimate  of  the  true  (auto  or  cross)  spectrum  G(f),  and 


W(f)  -  fdt  exp(-i2irft)  w(t), 


(2) 


•Integrals  without  limits  are  over  the  range  of  nonzero  integrand. 
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where  w(t)  is  the  data  window  multiplied  in  the  time  domain  by  the  available 
data.  It  is  assumed  that  the  data  window  is  time-limited  and  nonnegative: 


w(t) 


=  0  for  Itl  >  L/2  | 
*  0  for  Itl  <  L/2  I’ 


(3) 


where  L  is  the  window  duration.  The  restriction  to  nonnegative  data  windows 
guarantees  that  the  spectral  window  W(f)  peaks  at  the  origin.  It  should  be  noted 
that  (1)  is  true  with  no  restriction  on  the  available  record  length  T  and  with  no 
restriction  on  the  statistics  of  the  random  processes  involved,  except  that  the 
processes  must  be  stationary;  they  need  not  be  Gaussian  for  (1)  to  apply. 

The  desired  value  of  (1)  is  the  true  value  G(f);  therefore  the  bias  in  estima¬ 
tion  is  defined  as 


B(f)  =  E  { G(f )  j  -  G(f).  (4) 

We  approximate  this  bias  by  expanding  G(f-v)  in  (1)  according  to 

G(f-v)  S  G(f)  -vG'(f)  +  1/2  G"(f)v2  -  1/6  G’"(f)v3  +  1/24  G””(f)v4,  (5) 

where  the  prime  denotes  a  derivative.  Substitution  of  (5)  and  (1)  into  (4)  yields 
B(f)  =  1/2  G"(f) Jdv  v2  JW(v)  |2  +  1/24  G’"'[f)fdv  v4|W(v)|  2,  (6) 

where  we  have  assumed  (without  loss  of  generality)  that 

/dv|W(v)|2  =/dt  w2(t)  =  1,  (7) 


and  that  |W(v)|^is  even  about  the  origin;  that  is,  w(t)  is  a  unit-energy  real 
waveform . 

We  express  (6)  as 

B(f)  S  1/2  G"(f)  Dx  +  1/24  G’"'(f)  D2  s  Bi  (f)  +  B2  (f),  (8) 

where  window  constants 
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Dx  =  /dvv2  |W(v)|2, 
D2  =/dv  v4  j  W(v)  |2, 


(9) 
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are  independent  of  the  true  spectrum  G(f).  To  minimize  the  bias,  we  must 
therefore  minimize  Dj  and/or  D2.  subject  to  (3)  and  (7).  To  this  aim,  it  is 
useful  to  express  (9)  in  terms  of  the  time  domain.  There  follows,  by  use  of  (2), 

E>!  *  (2wr2  fdt  [w-(t)]2  (10) 

and 

D2  =  (2*r4/dt  [w-'(t)]2.  (11) 

Strictly,  the  approximation  (8)  to  the  bias  is  due  only  to  local  variations  in 
true  spectrum  G(f)  about  the  frequency  point  f  of  interest;  equation  (5)  is  not 
necessarily  a  good  approximation  for  larger  v.  Thus,  peaks  in  the  true  spectrum 
that  are  distant  from  the  point  f  under  investigation  are  not  accounted  for  by  (5). 
To  minimize  the  effects  of  remote  spectral  peaks  on  bias,  we  must  also  require 
that  the  spectral  window  W(f)  decay  sufficiently  rapidly  for  large  |f|.  Thus 
the  results  of  the  following  optimizations  are  not  final,  but  must  be  investigated 
to  see  if  they  also  meet  the  requirement  of  sufficiently  rapid  decay  with  frequency. 

In  addition  to  constraints  (3)  and  (7),  we  shall  be  interested  in  constraining 
the  bandwidth  of  the  window;  this  is  in  keeping  with  the  philosophy  of  requesting 
a  specified  frequency  resolution  for  spectral  estimation,  and  letting  the  window 
duration  L  and  overlap  be  whatever  is  necessary  to  meet  this  requirement  {6]. 

It  should  also  be  noted  that  constraints  on  bandwidth  tend  to  equalize  the 
variance- reduction  capabilities  of  the  windows.  This  may  be  seen  from  [6  , 
eq.  (22)],  where  the  equivalent  number  of  degrees  of  freedom  is  given  approxi¬ 
mately  by 


2TBst  for  T»L  ,  (12) 

where  Bst  is  the  statistical  bandwidth  [9,  p.  278]  of  the  window.  Thus,  if  all 
windows  were  constrained  to  have  the  same  statistical  bandwidth,  they  would  all 
have  the  same  variance-reduction  capabilities,  and  we  could  minimize  the  bias 
subject  to  this  constraint.  However,  this  constraint  is  not  mathematically  trac¬ 
table.  *  Therefore,  we  resort  toconstraints  on  other,  more  tractable,  bandwidth 
measures,  with  confidence  that  they  too  will  yield  comparable  variance  in  spec¬ 
tral  estimation  (see  {6,  table  1]). 


*We  have  not  been  able  to  express  the  time-domain  constraint  (3)  directly  in  the 
frequency  domain,  nor  have  we  been  able  to  express  the  requirement  that 
♦w(t)  be  a  legal  correlation  function  directly  in  the  time  domain;  see  [6, 
eqs.  (7)  and  (17)-(21)]. 


3 


TR  4513 


PROBLEM  SOLUTION 

Four  different  constrained  problems  will  be  addressed  in  this  section:  con¬ 
strained  window  duration  L  ;  constrainedequivalent-noise  bandwidth;  constrained 
haJ  f- power  band  width;  and  constrained  root-mean-square  ( rms  (bandwidth.  The  win¬ 
dow  duration  L  is  adjusted  to  meet  the  bandwidth  constraint  in  the  latter  three  cases. 

DURATION  CONSTRAINT 


Here  we  wish  to  minimize  Di  in  (10),  subject  to  constraints  (3)  and  (7)  and 
a  fixed  value  of  window  duration  L.  In  order  that  (10)  be  finite,  wit)  must  be 
continuous ;  therefore  w(tL/2)  =  0  from  (3).  When  we  use  a  calculus- of- variations 
approach,  the  optimum  window  wQ(t)  must  satisfy  the  differential  equation 

Wg(t)  +  X  w0(t)  =  0.  |t |  <  L/2,  (13) 


where  X  is  a  constant  (Lagrange  multiplier).  The  solution  of  (13)  that  satisfies 
the  boundary  conditions  and  (7),  and  has  minimum  is 

/  o  \  1/2 

wQ(t)  *  (yJ  cos(*t/L),  |t|  S  L/2.  (14) 


The  corresponding  value  of  (10)  is 


(15) 


Several  windows  are  compared  in  table  1.  It  is  seen  that  the  Hanning  window 
has  33  percent  greater  bias,  as  measured  by  B^f),  than  the  optimum  window, 
under  a  duration  constraint. 


Table  1.  Window  Bias  Constants 


Data  Window 

4L2Dj 

Optimum,  (14) 

Parabola 

Triangle 

Hanning 

1 

10/w2  =  1.01 
12/ir2  =  1.22 

4/3  =  1.33 
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The  spectral  window  corresponding  to  the  optimum  data  window,  (14),  is 

W  (t)  »-  (2L)1/2  c?Styi  .  (16) 

°  *  l-4L2f2 

The  decay  for  large  frequencies  is  only  as  f  .  The  sidelobes  of  this  and  the 
following  windows  will  be  discussed  later. 

EQ I  I Y  A  L  E N T-  N  OIS  E -  B A  N  DVV  IDT  H  C  ONSTR  A  IN T 

The  equivalent-noise  bandwidth  Be  of  spectral  window  W(f)  is  defined  as 

^.■felamll.— jl—  .  (I?) 

|W(0)|2  [/dt  w(t)]2 

where  we  have  used  (7)  and  (2).  The  quantity  Bg  can  be  interpreted  physically 
as  the  bandwidth  of  an  ideal  rectangular  filter  that  would  pass  the  same  amount 
of  power  as  a  filter  W(f) ,  when  subjected  to  white  noise;  see  figure  1.  The  peak 
of  |\V(f)|“  occurs  at  the  origin,  since  data  window  w(t)  is  nonnegative. 


Figure  1.  Equivalent- Noise- Bandwidth  Interpretation 


The  problem  here  is  to  minimize  Di  in  (10),  subject  to  constraints  (3),  (7), 
and  (17).  This  problem  is  solved  in  appendix  A,  with  the  result  that  the  optimum 
data  w  indow  is 


wQ(t)  = 


cos2(wt/L), 


|t|  S  L/2  . 


(18) 
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This  is  the  familiar  Hanning  window.  The  window  duration  L  must  be  chosen  as 

L-fi. 

according  to  (17).  The  minimum  value  of  Dj  is 

1  4  2 

Dl~  =“  Be.  (20) 

3L2  27 

The  specified  equivalent-noise  bandwidth  dictates  the  window  duration  L  and 
the  minimum  attainable  bias  constant  D^.  The  spectral  window  corresponding 
t°  <18)  is  l/2 

Wo(f) .  /JkA 

\  3  /  irLf(l-LZr) 


(t) 


where  L  must  be  determined  from  (19).  The  decay  for  large  frequencies  varies 

as  f  “3, 

HALF- POWER- BANDWIDTH  CONSTRAINT 

The  half-power  bandwidth  Bjj  of  spectral  window  W(f)  is  defined  as 

|w(±Bh/2)| 2  1 


We  desire  to  minimize  Dj  in  (10),  subject  to  constraints  (3),  (7),  and  (22). 
Converting  (22)  into  the  time  domain  and  restricting  w(t)  to  be  even,  *  constraint 
(22)  takes  a  desirable  integr?'  form: 

Jdt  w(t)[cos(trBj1t)  -  2_1,/2J  =  0  .  (23) 

The  solution  to  this  minimization  problem  is  presented  in  appendix  B.  The  optimum 
data  window  is 

w0(t)  =  L*l/2  Tl.682  +  4.261  cos(4. 434t/L) 

-  4.337  cos(3.  552t/L)],  |t|  S  L/2  .  (24) 


*An  odd  component  in  w(t)  increases  the  rate  of  variation  and  therefore 
increases  D1. 


6 


TR  ,513 


The  window  duration  L  must  be  chosen  as 


L  = 


1.411 

®h 


according  to  (23).  The  minimum  value  of  is 

Dj  =  0. 1604  . 


(25) 


(26) 


The  spectral  window  corresponding  to  (24)  is  obtained  by  employing  (2);  this  will 
be  discussed  in  the  next  section.  It  is  shown  in  appendix  B  that  wJj(tL/2)  =  0; 
therefore  the  decay  of  the  spectral  window  is  according  to  f”3. 

For  comparison  if  the  time  duration  of  the  Hanning  window  is  adjusted  to 
realize  the  specified  half-power  bandwidth,  namely  L  =  1.441/Bh  [6,  eq.  (33) 
and  table  1  ] ,  it  follow-s  that  Dj  =  0.1606  Bfj .  Thus  the  Hanning  window  has 
virtually  the  same  bias  as  the  optimum  window  under  a  half-power-bandwidth 
constraint.  Further  comparisons  are  made  in  the  next  section. 

ROOT- M EA N-SQUAR E- BANDWIDTH  C ONSTRA INT 

The  rms  bandwidth  Br  of  spectral  window  W(f)  is  defined  as 

2  _/df  f2lw(t)l2 

r"  jSnwmi*  ' 

Inspection  of  (7)  and  (9)  immediately  reveals  that 

Di  -  B?  .  (28) 

Thus  if  the  rms  bandwidth  is  constrained,  bias  constant  Dj  is  fixed.  In  this 
case,  it  is  reasonable  to  resort  to  minimization  of  the  second  bias  constant  D2 
in  (9)  or  (11).  Thus,  we  wish  to  minimize  (11),  subject  to  constraints  (3),  (7), 
and 

/dt[w'(t)]2M2irBr)2  .  (29) 

The  solution  to  this  problem  is  presented  in  appendix  C.  The  optimum  data  win¬ 
dow  is 

wQ(t)  = — - -  cos3(irt/L),  |t|  S  L/2  .  (30) 

(5L)1/2 
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The  window  duration  L  must  be  chosen  as 

L.  -L  -L  . 

2  ^  Br 

according  to  (29).  The  minimum  value  of  D2  is 

_  25  «4 

°2  =  9"  ®r  • 

The  spectral  window  corresponding  to  (30)  is 

cos(wLf) 


Wo<f> 


il(5L)1/2 
15w 


(l-4L2f2)(l-4L2f2/9) 


(31) 


(32) 


(33) 


where  L  is  determined  from  (31).  The  decay  for  large  frequencies  is  according 
to  r4. 


For  comparison,  let  the  time  duration  of  the  Hanning  window  be  adjusted  to 
realize  the  specified  rms  bandwidth  Br.  Then  employing  (18)  in  (29),  we  find 
L  *  1  / ( yj3  Br),  and  (11)  yields  D2  *  3  Bp  .  Thus  the  Hanning  window  has 
8  percent  more  bias  than  the  optimum  window  under  an  rms  bandwidth  constraint, 
as  measured  by  bias  constant  D2. 

COMPARISON  OF  WINDOW  CHARACTERISTICS 

In  figure  2,  one-half  of  the  symmetric  optimum  data  windows  for  the  three 
bandwidth-constrained  cases  are  drawn  for  a  common  time  duration  of  L  =  1. 
The  equivalent-noise-bandwidth  data  window  (Hanning)  and  the  half- power- band¬ 
width  data  window  are  virtually  identical  and  are  continuous  in  value  and  deriv¬ 
ative  at  0.5.  The  rms-bandwidth  data  window  is  more  peaked,  and  goes  to  zero 
in  value,  in  derivative,  and  in  second  derivative  at  0.5.  Thus  the  last  window 
would  require  greater  overlap  than  the  first  two,  in  order  to  realize  the  same 
variance  reduction;  see  [6j. 

In  order  to  deduce  the  required  overlap  for  the  rms  bandwidth  data  window, 
the  quadratic  and  cubic  data  windows  (6,  pp.  10-18]  are  superposed  in  figure  3. 
Over  most  of  the  range,  the  quadratic  and  rms-bandwidth  windows  are  very  close. 
Near  the  end  of  the  range,  however,  the  taper  of  the  rms-bandwidth  window  ap¬ 
proaches  that  of  the  cubic;  in  fact,  both  are  continuous  in  second  derivative  at 
0.5.  Thus,  it  is  anticipated  from  earlier  results  [6,  table  4  ]  that  slightly  over 
65  percent  overlap  would  be  required  for  the  rms  bandwidth  data  window  to  realize 
99  percent  of  its  maximum  equivalent  degrees  of  freedom  [  4,  p.  22]. 
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2  Or 


Figure  2.  Three  Bandwidth- Cons  trained  Data  Windows; 
L  =  1,  Unit  Energy 


Figure  3.  Comparison  of  RMS-Bandwidth  Data  Window  with 
Quadratic  and  Cubic  Data  Windows;  L  =  1,  Unit  Energy 


9 


TR  4513 


The  spectral  characteristics  of  the  four  windows  derived  in  this  report  are 
presented  in  figures  4  through  7.  The  abscissas  on  every  plot  are  in  units  of  the 
half-power  bandwidth;  thus  all  the  curves  go  through  half-power  (-3.01  dB)  at 
f/Bh  =  0.5.  The  duration-limited  window,  (14),  is  plotted  in  figure  4  and  ex¬ 
hibits  relatively  slow  decay  with  frequency,  since  the  data  window  is  discontin¬ 
uous  in  derivative  at  its  edge.  The  equivalent-noise-bandwidth  (Hanning)  and 
half- power- bandwidth  spectral  windows,  plotted  in  figures  5  and 6,  are  virtually 
identical  and  have  good  decay  with  frequency,  since  the  data  windows  are  con¬ 
tinuous  in  derivative  at  their  edges.  The  spectral  window  for  the  rms-bandwidth 
data  window  is  plotted  in  figure  7  and  exhibits  very  rapid  decay  with  frequency. 
However,  as  noted  above,  by  virtue  of  requiring  greater  overlap  for  maximum 
variance  reduction,  this  window  will  require  somewhat  greater-size  FFTs  than 
do  the  other  windows. 


CONCLUSIONS 

The  Hanning  window  is  optimum  under  an  equivalent- noise-bandwidth  con¬ 
straint,  as  far  as  minimization  of  bias  constant  is  concerned.  Furthermore, 
it  is  near  the  optima  for  two  other  bandwidth  constraints.  Its  spi  ctral  decay  is 
also  sufficient  for  most  cases  that  the  bias  is  relatively  unaffected  by  distant 
spectral  peaks.  And  with  50  percent  overlap,  it  realizes  92  percent  of  the  max¬ 
imum  number  of  equivalent  degrees  of  freedom  [6,  table  6],  Thus,  the  Hanning 
window  is  a  reasonable  compromise  to  utilize  in  spectral  estimation  of  random 
stationary  data. 
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11 


Figure  4.  Spectral  Window  for  Duration- Limited  Data  Window 
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Figure  7.  Spectral  Window  for  RMS- Bandwidth  Data  Window 
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Appendix  A 

DERIVATION  OF  THE  OPTIMUM  DATA  WINDOW  FOR  AN 
EQUIVALENT-NOISE-BANDWIDTH  CONSTRAINT 

We  wish  to  minimize  (10)  (from  the  main  text),  subject  to  constraints  (3), 
(7),  and  (17).  Equations  (7)  and  (17)  are  integral  constraints  and  are  in  a  con¬ 
venient  form  for  a  calculus- of- variations  approach.  The  way  we  handle  (3)  is  to 
first  ignore  the  nonnegative  limitation;  then,  out  of  the  class  of  allowable  sol¬ 
utions,  we  restrict  attention  only  to  the  nonnegative  solutions  and  pick  the  best. 

In  order  that  (10)  be  finite,  w(t)  must  be  continuous.  Using  (3),  we  see  that 
this  means  that 


w(±L/2)  =  °.  (A*1) 

A  calculus-of-variations  approach  tells  us  to  minimize  the  quantity 

Q  «*tb)  +2\x$<*twk)y  (A- 2) 

where  X,  and  are  Lagrange  multipliers;  the  resulting  differential  equation 
for  the  optimum  window  is 


+  X,W#ft)  =  X*  ,  W  <  L/2’ 


(A- 3) 


We  employed  (A-l)  on  the  allowed  variations  in  deriving  (A— 3). 
The  general  solution  of  (A-3)  is 


v.(t) 


A  c©s(«t)+  B  3»n(o'0  4-  C 
A  C03V,(of)+  B  *«H(<rfc)  +  C  \  >  ^  <  L/2  •» 


(A-4) 


•F 


A  +  Bt 


where  ft  is  real  and  positive.  The  third  alternative  in  (A-4)  yields  the  trivial 
solution  when  (A-l)  is  imposed. 
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The  second  alternative  reduces  to 

K3|  *  A[cosk(tft)  -  cos  W  S  L/2.  (A-5) 


Since  A  can  be  chosen  to  satisfy  the  energy  constraint  (7),  and  L  can  be  chosen 
to  satisfy  the  bandwidth  constraint  (17),  there  is  left  only  the  variable  CL  in 
(A-5)  to  vary;  (i)  is  certainly  nonnegative  by  the  choice  of  proper  polarity 
for  A.  In  order  to  find  the  best  value  of  <X  for  this  second  alternative  of  (A-4), 
we  compute  D.  in  (10)  versus  C.  and  pick  the  minimum. 

To  accomplish  this  goal,  we  define 

Jjpl.*)  5  H(-W)  3  AM*))  M  -  (A“6) 


where 


and 


J)(uj  »  coshWu)  -CoshW,  l«l*  I 


s  W2. 

The  bandwidth  constraint  (17)  then  becomes 

a?  k.  -  b;*  , 


where 


K,  =  f  Juiu). 

-  I 


The  energy  constraint  (7)  becomes 

kZ  u 


where 


A  T  K,  - 

k  *  r'd 


The  window  constant  (10)  becomes 

*  2^  L“  ' 


(A-7) 

(A-8) 

(A- 9) 

(A-10) 

(A- 11) 

(A-12) 

(A- 13) 
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where 

K|  *  J  dtt  [A'/kJ  . 


(A- 14) 


If  we  eliminate  A  and  L  by  using  (A-9)  and  (A-ll),  then  (A-13)  becomes 


T>, 


Ki 


B. 


(A-15) 


The  quantities  necessary  in  (A-15)  follow  from  (A-7),  (A-10),  (A-12),  and  (A-14): 

K,  * 


(A-16) 


When  (A-16)  is  substituted  in  (A-15),  is  found  to  increase  monotonically  with 
increasing  K  ;  the  limit  as  o(-*  H  is 


US'  -a 
12ic' 


(A-17) 


The  first  alternative  in  (A-4),  when  subjected  to  boundary  condition  (A-l), 
breaks  into  two  subcases.  In  the  first,  if  9ih(a()  *  0,  then  "B  is  arbitrary 
Then  «<  *  Ktt,  K  >  I  The  function  with  smallest  D,  corresponds  to  K*l, 
and  yields 


W,  K)  sr  A[cos(>wt/L)  4  'il  +  8  SH»(2Trt/0,  W  *  Ll 2 


However,  we  must  have  B  s  0  ;  otherwise  W9lt)  would  go  negative  somewhere. 
Imposition  of  the  energy  constraint  (7)  and  bandwidth  constraint  (17)  yields 

W.W  *  cos*(iri/t)  ,1115  0,  «A-19» 

where 


L 


(A- 20) 
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The  corresponding  value  of  !Di  is 

_  I _  _  A  -pi 

•P|  S  3La  *  27  * 

which  is  smaller  than  (A-17). 


(A- 21) 


The  second  subcase  occurs  if  Sin(*)  *  0  •  Then  B  in  (A- 4)  must  be 
zero,  and  we  get 


%{t)  *  C0S^j  W  -  L/2> 


where  •<  k"rr  .  The  comments  and  method  immediately  below  fA-5)  apply 
equally  well  here.  Therefore  we  define 


J?h)  ~  Cos(<u)  -  COs(«d;  |tt|  S  I  ,  <  A  Ktt, 


(A-23) 


and  find 


(A- 24) 


K 


X 


1L 

■2  J  • 


When  (A-24)  is  substituted  in  (A-15),  p,  is  found  to  decrease  monotonically  with 
increasing  «<.  ,  at  least  for  o<  up  to  ir.  However,  when  s< > TT*.  M*)  becomes 
negative  somewhere;  this  may  be  seen  by  noting  that  Jn\i)  =  -«<  sin  (p<)  is 
positive  if  <>w.  Thus  the  limiting  member  of  this  subclass,  which  is  (A-19),  is 
the  optimum  window. 
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Appendix  B 

DERIVATION  OF  THE  OPTIMUM  DATA  WINDOW  FOR  A 
HALF-POWER-BANDWIDTH  CONSTRAINT 


Here  we  will  minimize  (10),  subject  to  constraints  (3),  (?),  and  (23),  In 
order  that  (10)  be  finite,  (A-l)  must  again  be  true.  A  calculus- of- variations 
approach  tells  us  to  minimize 

Q  =  +  whr)clO,  (B-1) 

where  „ 


cttfs  cos(u>t)-  k,  wsttBK)  a*  -2  , 


(B— 2) 


and  X,  and  are  Lagrange  multipliers ;  the  resulting  differential  equation  for 
the  optimum  window  is 


Wc"  It;  +  X,  W.Hrj  *  X2  eft),  Hr)  <  l/2  . 


(B-3) 


If  A,  is  negative,  the  form  for  wvfc)  includes  sinh  and  cosh  terms,  which 
lead  to  a  progressively  larger  value  of  D,  as  X,  becomes  more  negative, 
similar  to  the  result  of  appendix  A.  If  X,  is  zero,  the  only  solution  to  (B-3) 
and  (A-l)  is  the  trivial  solution.  If  X,  is  positive,  the  general  solution  to  (B-3) 


A  eosU*) >'*u 


A  CoS +  X*w 


(B-4) 


We  discard  the  odd  solutions  for  the  reason  given  in  the  footnote  to  (23). 

If  we  attempt  to  use  the  second  alternative  in  (B-4),  we  can  eliminate  A 
by  means  of  energy  constraint  (7).  However,  the  bandwidth  constraint  (23)  can 
not  be  satisfied  for  any  value  of  (a)L  ^  0  •  Therefore,  we  must  discard  the 

second  alternative. 

To  handle  the  first  alternative  in  (B-4)  conveniently,  we  define  a  function 


^  In)  «  w# (4*u)  + *^rj r  C05(K“)  “  y  &  >  W  <  l> 


(B-5) 
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where 


l*£  *  '>  < 


(B-6) 


Imposition  of  (A-l)  forces  (B-5)  into  the  form 

u*>  ■  c[^  -  ■  ci»,  w  > 

where  C  is  a  constant.  The  energy  constraint  (7)  yields 

C  -  (&f  ,  (b-8, 

where  we  have  used  (B-5)  and  (B-7),  and  defined 

K„  =  /«)•  (B-9) 

Satisfaction  of  bandwidth  constraint  (23)  demands  that 

0  -  jdu^osfrfiO-frUlu),  (B.10) 

where  we  employed  (B-6)  and  (B-7).  Substitution  of  the  detailed  form  for 
(B-7),  into  (B-10)  yields  the  relation 

0)5  Wi> [2S-5 w  -  2 fr2  +  £  J H*  +  SUW)  -  *  S K>}] 

»  ^cosW+S-ll-<f)][5 (•«}  +  *)  +•  ,  (B~U) 


(B-8) 


(B-9) 


where 


50*) 


Sin(x) 


(B-12) 


For  a  given  value  of  (B-ll)  must  be  solved  for  the  smallest  value  of  *  ;  fir 
is  a  known  specified  constant.  In  order  to  find  the  best  value  of  ^  ,  we  com¬ 
pute  3),  versus  \  and  pick  the  minimum,  always  being  careful  that  Jfci^remain 
nonnegative  for  all  Ms  i .  The  quantity 


(B-13) 


K,  " 


20 


TR  4513 


using  (B-5),  (B-7),  (B-8),  (B-6),  and  defining 

K,  s  JJuCAWI  .  (B-14) 

The  quantities  K*  and  K,  are  available  upon  substitution  of  (B-7)  in  (B-9)  and 
(B-14).  There  follows,  upon  use  of  (23), 

K,  «  C*[l+  3(2fp]  + 

+  C,C,[2&(2-f)5kl.)*,V  SK4")+  (B-15) 

K>  *  **<f  [c?{l-S(*i3  +  i  5W] 

+  “)  ‘  S(«^,+  "4],  (B-16) 

where 

Ci  *  s  ‘[£C05W  +  S-C'-tfl  •  (B-17) 

The  numerical  approach  may  now  be  summarized  as  follows,  A  value  of 
c ^  is  picked,  and  (B-ll)  is  solved  for  U  .  Then  (B-17)  is  computed  and  sub¬ 
stituted  in  (B-15)  and  (B-16),  thereby  enabling  evaluation  of  D,  in  (B-13)  for 
that  choice  of  0  .  (Up  to  this  point,  fir  could  be  any  desired  constant;  we  now 
restrict  fir  ■  )•  When  this  approach  is  tried,  it  is  found  that  D, 

increases  monotonically  with  increasing  ^  .  On  the  other  hand,  when  is 
made  too  small,  becomes  negative  near  u»| .  The  optimum  value  of  ^ 
is  realized  when  »0.  From  (B-7)  and  (B-17),  this  requirement  is 

C,  5m^)  +  C,  ^  swk)  -  o.  (B— 18) 

The  simultaneous  solution  of  (B-ll)  and  (B-18),  with  smallest  ^  ,  is  then  given 
by 

<p.»01l  H1fi  ,  <=  2.217  0 SIS.  (B-19) 

The  optimum  value  of  P,  then  follows  from  (B-13)  as 

T>,  -  .IfiC*  4*J0  ,  (B-20) 

and  the  segment  length  follows  from  (B-6)  as 
,  ^  i  _  i.Hii  423? 

L  ~  ^  '  (B_21) 
Finally,  the  optimum  window  follows  from  (B-5)  as 

£7j|  t  4.2*1  041 1  c^(«.434  ItH  VO  B  22) 

-  4.  337  3W  cos  (3. 5T2  2i  13  VO] ,  /V  S  L/*- 
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Appendix  C 

DERIVATION  OF  THE  OPTIMUM  DATA  WINDOW  FOR  AN  RMS- 
B  AND  WIDTH  CONSTRAINT 

The  problem  here  is  to  minimize  (11),  subject  to  constraints  (3),  (7),  and 
(29).  In  order  that  (11)  be  finite,  must  be  continuous.  Using  (3),  this  means 
that 

w(±L/l)=0,  w'(±L/i)*0  (C-l) 

A  calculus-of-variations  approach  tells  us  to  minimize  the  quantity 

Q  '*)]*  -I-  W*/f),  IC-2, 

where  X  and  ^  are  Lagrange  multipliers;  the  resulting  differential  equation  for 
the  optimum  window  is 

“  °’  W  '  L/ *■  (C‘3' 

We  employed  (C-l)  on  the  allowed  variations  in  deriving  (C-3). 


To  solve  (C-3),  we  assume  a  form  e*p(st)  for  Substitution  in  (C-3) 

requires  that  &  be  chosen  to  satisfy 

s4-  xw  =  o  ;  ±  U1-  <c-4> 

At  this  point,  several  alternatives  are  possible.  The  first  case  we  pursue 
is  a  negative  discriminant  in  (C-4).  Then  the  four  values  for  3  in  (C-4)  can  be 
expressed  as  ^ 

S  =  2,2  /  ~  2  ,  (C-5) 

where  *  is  a  complex  constant  with  nonzero  imaginary  part.  For  distinct  roots 
(i.  e. ,  tt  not  purely  real),  the  optimum  window  is 

W,  k)  '  h  i-B  4  Ce*p(-»t)  +  D  (C-6) 


In  order  that  (C-l)  be  satisfied  with  a  nontrivial  solution,  it  is  necessary  that 
the  determinant 

E  E*  */E  '/£* 

'/E  */£  £  £* 

i€  -i/e  -*/e'  (C_ 
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equal  zero,  where  E  *  exp(it./2.)  .  This  requires  that 

-  o.  «>«> 

But  none  of  the  factors  in  (C-8)  can  be  zero  without  double  roots  resulting  for  s, 
in  which  case  the  form  (C-6)  is  not  appropriate.  Therefore,  the  negative- 
discriminant  case  is  self-destroying. 

The  second  case  corresponds  to  a  zero  discriminant  in  (C-4).  Then  we  have 
three  subcases: 

r 

*  =  J  :?\;a  l  a  real  and  positive.  (C-9) 


[  ®»  °»w°>  0  J 

For  subcase  1,  the  form  for  is 


*0  (f)  =  f\  cosk  («t)  +-  B  Sink  (a't)  4*  C  t  Cosk  (at)  4"  Bt  5  ink  (at) .  ( c_ 


Imposition  of  (C-l)  requires  that 

sink l a l)  =  ±  aL 


(C-ll) 


for  a  nontrivial  solution.  But  (C-ll)  has  no  solutions  for  positive  real  q  . 


For  subcase  2  in  (C-9),  the  form  for  Xfel^is 

Hit)*  A  cas(ut)  4-B  3in(«t)  4  Ct  ®s(at)  4-J)t  sin(at). 

Imposition  of  (C-l)  requires  that 

5ui(ol)  *  ±  a  L 

for  a  nontrivial  solution.  Again  this  is  disallowed. 

For  subcase  3  in  (C-9),  the  form  is 

Hit)  =  A  4-  Bt  4  ctx+-£t3. 

Imposition  of  (C-l)  yields  only  the  trivial  solution. 


(C-12) 


(C-13) 


(C- 14) 


The  third,  and  last,  case  we  must  now  consider  is  a  positive  discriminant  in 
(C-4).  Then  we  have  three  subcases: 
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r  ±«,±r\ 

3-  l  ±a,  , 

I  ±iaytit 


A  and  £  real. 


(C-15) 


For  subcase  1,  we  have 


W#ffc)  =  A  cosKlcrO+Bs^loO^CcoiMt^  +  Ps'^^t).  (C-16) 


Imposition  of  (C-l)  requires  that 


where 


*  bafa  =  p  W(p)  °r  ' 

<5  AI-/2,  f  ?-  bL/2. 


(C-17) 


(C-18) 


The  only  solutions  of  (C-17)  are  mi  ;  that  is,  ft  -  b  .  However,  these  have 
been  considered  already  in  (C-10)  and  found  inadequate. 

For  subcase  2  in  (C-15),  we  have 

=4  «MV»U‘fc)  +  3  5»nV*(art)  +•  C  OJjUt)  vDsln^t).  (C-19) 


Satisfaction  of  (C-l)  demands  that 


’tfrnh  (k)  _ 

*  =  P 


(C-20) 


The  second  alternative  in  (C-20)  leads  to  odd  solutions  only,  in  (C-19),  and  they 
must  be  discarded  because  of  their  higher  variation  rate.  The  first  alternative 
in  (C-20)  leads  to 


j  n-1-  ffj»hua _ a*5  i^-1 ,  itu  t 

•*'  L  L  »•>>  w  J 


(C-21) 


which  is  a  legal  nontrivial  solution.  The  values  of  4  and  ^  are  related  as  shown 
in  figure  C-l.  To  handle  this  alternative  conveniently,  we  define  a  function 

JJ.IO  1  =  C[  *  C>/k);  Mis  I.  (C-22) 
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Figure  C-l.  Relationship  of  a  and  (J  in  (C-20) 


Then 


jjt'  W  =  xM>‘  (t*),  3."  W  wo’  ("H . 

The  energy  constraint  (7)  requires  that 

C  • 


where 


Kj  >1  [ig>00]2 


The  bandwidth  constraint  (29)  requires  that 

L  *  TT  \lW 


(C-23) 


(C-24) 


(C-25) 


(C-26) 


where  we  have  employed  (C-23),  (C-24),  and  (C-25).  Then  we  use  (C-23)  through 
(C-26)  to  determine  the  bias  constant  1>t  as 

•>.  _ I _  f  VL  r.  N  T>^  on  1 


*  *  = < B" 


(C-27) 


For  the  curren  example  in  (C-22),  we  evaluate 


C osjfw 


Zos'lf) 
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where  we  have  employed  the  first  alternative  in  (C-20),  and  defined 


(C-  28) 

(C-29) 


For  an  arbitrary  V  ,  we  solve  the  first  alternative  in  (C-20)  for  ^  , 

and  then  compute  Da  ,  by  means  of  (C-27)  and  (C-28).  It  is  found  that  ID, 
increases  monotonically  with  o*  .  The  minimum  is  realized  when  of  *  o  , 
namely 


(C-30) 


The  third  subcase  in  (C-15)  yields  the  form 

W A  C*s(at)  +  3  («*)  +  C  +T>  s'm  ^  •  (C-31) 

The  boundary-  conditions  (C-l)  demand  that 

»>  °r  ‘c'32> 

However,  the  second  alternative  in  (C-32)  yields  odd  solutions  and  is  discarded. 
The  first  alternative  yields 

c 

The  values  of  and  i  are  related  as  shown  in  figure  C-2.  As  above, 
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we  define  a  function 


=  w.(^u)  =  cMO  =  c[- 


cos^u) 


jfDSfeu) 

"C8SW" 


coaK) 

We  can  now  employ  (C-25)  through  (C-27)  immediately.  We  evaluate 


l 


(C-34) 


K  *  J-+SM  -2  -  + 

6  ces*^  cos  W  K.os(p) 


C052(v)  »  Cts(p)  I 


\  +  5ty) 

testy) 

I- Sty) 

COi2 (?) 


(C-35) 


K , <_i±3lv)  |./  . 

1  «**W  r  cos  (*)  c*s(p)  \  costy) 

For  an  arbitrary  X  (  ^  ir/ z)  ,  we  solve  the  first  alternative  in  (C-32)  for 
Q  ,  and  then  compute  1>a  using  (C-27)  and  (C-35).  It  is  found  that 
decreases  monotonically  with  increasing  •/  ,  at  least  for  up  to  tt/2  ,  the 
limit  being  25/9  at  ^  =  tr/z  “  .  However,  when  X  >  tt/2  ,  J^a)  goes 

negative  somewhere  and  is  unacceptable.  For  *  rr/z  ,  (C-32)  is  not  an 
adequate  form;  we  note  instead  that  fi  =  3ir/z  from  (C-31),  and  then 

The  boundary  conditions  (C-l)  force  A  -  -3B  ,  giving 

J( ('U  -  C053(¥«). 

We  then  find 


(C-36) 


(C-37) 


«f’  K- 


Vrrl 

32  > 


K  = 


45V 

12? 


(C-38) 


yielding 


"n  -  'R4 

Px  =  ~o  Dr  > 


(C-39) 


which  is  smaller  than  (C-30).  Thus  (C-37)  is  the  optimum  window.  (Notice  that 

JC( i)  =>  o  ) 
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An  Approximate  Fast 
Fourier  Transform 
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Spectral  Analysis 


Albert  H.  Nuttall 

ABSTRACT 


An  approximate  and  quick  fast  Fourier  transform 
technique  for  vernier  spectral  analysis  is  derived 
and  tested  for  several  candidate  time  and  delay 
weightings,  and  for  overlaps  of  the  time  weightings. 
For  50  percent  overlap,  the  use  of  a  simple  cosine 
lobe  for  the  time  weighting  yields  spurious  spectral 
sidelobes  at  least  23  dB  below  the  main  peak,  where¬ 
as  Dolph-Chebyshev  time  weighting  achieves  -33  dB 
sidelobes.  For  75  percent  overlap,  use  of  a 
(cosine)S  lobe  for  the  time  weighting  yields  side¬ 
lobes  at  least  54  dB  down,  whereas  Dolph-Chebyshev 
time  weighting  achieves  -86  dB  sidelobes.  In  both 
cases  of  overlap,  use  of  delay  weighting  is  also 
required  and  is  taken  as  a  Hanning  weighting.  Ex¬ 
tensions  to  other  overlaps  and , weightings  are 
possible. 


Approved  for  public  release;  distribution  unlimited. 
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AN  APPROXIMATE  FAST  FOURIER  TRANSFORM  TECHNIQUE 
FOR  VERNIER  SPECTRAL  ANALYSIS 

INTRODUCTION 


To  detect  the  presence  of  very  narrowband  weak  signals  in  noise,  and  to 
measure  their  center  frequencies  accurately,  it  is  necessary’  to  Fourier  trans¬ 
form  a  long  time  segment  of  the  available  process.  When  the  center  frequencies 
of  the  signal  components  are  unknown  and  the  total  search  bandwidth  of  interest 
is  large,  this  procedure  demands  storage  and  computation  of  many  degrees  of 
freedom,  that  is,  search  of  a  large  time-bandwidth  product  space.  It  would  be 
advantageous  if  a  quick,  coarse  search  for  narrowband  components  could  be 
conducted,  followed  by  a  finer  vernier  analysis  over  a  limited  band  where  the 
presence  of  narrowband  components  has  been  indicated.  Such  an  adaptive  pro¬ 
cedure  would  be  less  time-consuming  and  require  less  storage.  Also,  if  the 
procedure  did  not  need  to  be  exact,  but  yielded  an  approximation  with  accept¬ 
able  sidelobes,  the  required  storage  and  computation  might  be  reduced  further. 

This  report  presents  just  such  a  technique,  which 

1.  accepts  the  input  process  in  smaller  time  segments  as  they  are  available, 

2.  performs  a  reasonable-size  weighted  fast  Fourier  transform  (FFT)  on 
each  overlapped  segment, 

3.  stores  only  that  frequency  portion  (at  each  segment)  where  narrowband 
components  are  indicated  to  be  present,  and 

4.  performs  a  small-size  weighted  FFT  over  the  total  data  record  available, 
for  each  frequency  bin  stored. 

Steps  1  and  2  permit  smaller-size  FFTs  than  would  be  required  if  the  total  data 
record  were  spectrally  analyzed  in  one  operation.  Steps  3  and  4  constitute  the 
adaptive  feature  of  this  technique.  The  last  transform  over  time  (delay)  in  step  4, 
for  each  frequency  bin,  is  a  vernier  frequency  analysis,  measured  from  the 
center  of  each  bin;  the  degree  of  approximation  of  this  technique  is  the  subject 
of  this  report. 
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Some  past  work  on  performing  large-size  FFTs  by  means  of  several  smaller 
FFTs  is  reported  in  references  1  and  2.  The  methods  reported  there  are  exact, 
but  they  consume  more  time  and  require  more  storage  than  the  method  to  be 
presented  here.  In  particular,  the  two  methods  of  reference  1  require  too  many 
small-size  FFTs,  and  the  method  of  reference  2  requires  additional  multiplica¬ 
tions  by  complex  exponentials  and  a  fair  amount  of  storage.  The  approximate 
technique  of  reference  3  is  similar  to  the  one  outlined  above,  up  to  step  4,  with 
the  notable  exception  of  overlapped  weighting;  at  that  point  the  technique  of  ref¬ 
erence  3  requires  transformation  back  to  the  time  domain  followed  by  another 
transform  to  the  desired  frequency  domain.  Additional  transforms  are  required 
in  this  last  technique,  and  it  produces  greater  sidelobes  than  the  new  technique, 
especially  when  the  temporal  weighting  is  judiciously  selected. 

FUNDAMENTAL  SPECTRAL  RE LATIONSHIPS 
LARGE-SIZE  FFT  APPROACH 

Before  embarking  on  the  approximate  technique,  we  review  the  standard 
large-size  FFT  approach  to  spectral  analysis.  Suppose  a  data  waveform  x(t) 
is  sampled  at  time  instants  nA,  n  integer.  Then  the  voltage  density  spectrum 
that  can  be  computed  is* 

V(f)  s  f  dt  exp  (-i2wft)  x(t)  u(t)  a  a  (t) 

4  dt 

=  A  £  exp  (-i2irfnA)  x(nA)  u(nA)  , 
n 

where  u(t)  is  a  temporal  weighting  deliberately  imposed  to  control  spectral 
sidelobes,  as  will  be  discussed  shortly;  see  figure  1A.  The  finite  duration  of 
u(t)  terminates  the  integral  and  sum  in  equation  (1)  at  finite  limits.  The  im¬ 
pulse-train  function  in  (1)  is  defined  as  the  infinite  sum 

(t)  s  £  «  (t  -  nA)  .  (2) 

n 

The  integral  representation  in  (1)  allows  us  to  expresst 

V(f)  =  X(f)  •  U(f)  •  $£(f)  ,  (3) 


•All  integrals  are  over  the  range  of  nonzero  integrand. 

^The  Fourier  transform  of  the  lower-case  time  function  x(t)  is  the  upper¬ 
case  frequency  function  X(f);  this  notation  is  used  throughout. 
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where  •  denotes  convolution,  and,  in  keeping  with  (2), 

£»(<)=  ES(f- T>-  <4> 

m 

Thus,  the  observed  spectrum  V(f)  is  the  convolution  of  X(f)  with  the  set  of 
windows*  U(f)  •  fiy^f),  which  is  depicted  in  figure  IB. 


Figure  1A.  Temporal  Weighting 


Because  the  ideal  spectral  window  is  a  single  impulse  at  f  =  0,  the  aliased 
mainlobes  at  m/A,  m  4  0,  are  undesired.  Also,  the  window  U(f)  is  desired 
to  be  narrow,  with  very  small  side  lobes.  Since  the  weighting  u(t)  is  of  limited 
duration,  the  mainlobe  width  of  U(f)  is  not  zero,  but  is  inversely  proportional 
to  the  time  duration. 


Now,  if  the  voltage  density  spectrum  V(f)  is  computed  at  multiples  of 
(LA)'1,  where  LA  is  the  time  duration  of  u(t),  we  obtain 


V 


£  exp  (-i2wnp/L)  x(nA)  u(nA)  , 
n 


p  integer  . 


(5) 


♦In  the  time  domain,  u(t)  is  called  a  weighting;  the  corresponding  Fourier 
transform  in  the  frequency  domain,  U(f),  is  called  a  window.  This  nomencla¬ 
ture  is  used  throughout. 
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Since  V(f)  is  periodic  of  period  1/A  (see  (1)),  (5)  need  be  computed  at  L 
different  points;  thus  it  can  be  realized  as  an  L-point  FFT  of  sequence 
|x(nA)  u(nA)} .  For  fine  frequency  analysis  (that  is,  large  LA)  the  size  L  of 
the  FFT  may  be  too  large  to  compute  easily,  under  storage  and  time  limitations. 
The  values  in  (5)  are  samples  of  the  convolution  of  figure  IB  with  voltage  density 
spectrum  X(f). 

APPROXIMATE  FFT  TECHNIQUE 


Just  as  we  started  above  with  an  integral  definition  of  a  spectrum,  then 
showed  that  samples  of  this  spectrum  were  attainable  with  an  FFT,  we  begin 
with  the  spectral- delay  function,  a,  defined  as 


a(f,r)  =  /  dt  exp  (-i2»ft)  x(t)  w(t  -  f)  A6.(t) 

(6) 

=  A  53  exP  (~i2»fnA)  x(nA)  w(nA  -  r  )  . 
n 

The  temporal  weighting  w  is  now  delayed  by  r  seconds;  if  the  duration  of  w 
is  seconds,  the  function  w(t  -  r)  picks  out  a  delayed  portion  of  data  x 
of  length  L w ,  and  subjects  it  to  the  same  transform  as  in  (1).  This  operation 
is  depicted  in  figure  2A,  where  the  temporal  weighting  can  be  located  at  aa, 
bb, ....  cc.  This  figure  is  drawn  for  50  percent  overlap  of  the  temporal  weight¬ 
ings;  however,  other  overlaps  are  possible  and  recommended  in  some  cases. 

The  next  step,  consistent  with  step  4  in  the  Introduction,  is  to  perform  a 
Fourier  transformation  on  the  delay  variable  r,  while  holding  frequency  vari¬ 
able  f  fixed.  The  general  definition  is  the  vernier  spectrum 


Y(ftp)c  /dr  exp  (-i2w  v  r)  a(f,  r )  d(r)  S  6^. ( r ) 

=  S  53  exP  (~12»i»kS)  a(f,  kS)  d(kS)  , 
k 


(7) 


where  »  is  the  vernier  frequency,  d(r)  is  called  the  delay  weighting,  and 
S  is  the  separation  increment  in  delay  r  at  which  a(f,  r)  must  be  computed; 
that  is,  S  is  the  shift  between  temporal  weighting  locations  <see  figure  2A>. 
Since  the  separation  S  in  delays  can  be  taken  to  be  smaller  than  the  temporal 
weighting  duration  Ly,- ,  (7)  allow  s  for  overlapped  weighted  transformation  of 
the  available  data  (see  (6)). 
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The  operation  described  by  (7)  is  depicted  in  figure  2B.  When  the  Fourier 
transform  (6)  on  weighted  time  segment  aa  is  completed,  the  set  of  frequency 
components  denoted  by  the  vertical  line  of  Xs  at  the  end  of  segment  aa  are 
available.  Similarly,  frequency  component  values  at  the  ends  of  segments  bb, . . . , 
cc  are  indicated  in  figure  2B;  these  components  correspond  to  delayed  locations 
of  the  temporal  weighting  w.  Now,  for  a  fixed  frequency,  say  fj  ,  the  array 
of  (delayed)  frequency  components  indicated  in  a  horizontal  box  in  figure  2B  is 
subjected  to  a  delay  weighting  and  is  Fourier  transformed  according  to  (7), 
thereby  yielding  vernier  spectrum  Y(fj,  »).  Similar  outputs  are  available  for 
other  (adjacent)  frequencies  of  interest,  such  as  f£  or  fg  . 


Figure  2A.  Overlapped  Temporal  Weightings 


F'BQuenCV 


a  b  «  b  c  c 


Figure  2B.  Adjacent  Delay  Weightings 
Figure  2.  Temporal  and  Delay  Weightings 
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Samples  of  the  vernier  spectrum  in  v  at  multiples  of  (MS)"  ,  where  MS 
is  the  duration  of  delay  weighting  d(r),  are  given  by 

Y  ( f,  — )  =  S  5"'  exp  (-i2»mk/M)  a(f,kS)  d(kS),  m  integer,  (8) 
V  MS/  k 


which  can  be  realized  as  an  M-point  FFT  of  the  sequence  ja(f,  kS)  d(kS)f  of 
length  M.  The  periodicity  of  Y(f,  »)  in  »  ,  of  period  1/S  (see  (7)),  means 
that  (8)  need  be  computed  only  at  M  different  values  of  m. 


Finally,  samples  of  vernier  spectrum  Y  in  frequency  f  at  multiples  of 
(NA)-1  are  given  by  (using  (8)) 

/  P  m  \  M-l  /  P  \ 

Yl - ,  - I  =  S  £  exP  (~i2»mk/M)  a( -  ,  kS]  d(kS)  , 

\NA  MS  /  k=0  \NA  /  ... 

p  =  0, 1, .  . . ,  N  -  1;  m  =  0, 1, . . . ,  M  -  1  , 


where  delay  weighting  d(r)  has  been  selected  so  that  samples  |d(kS)|  are 
nonzero  only  for  k  =  0, 1, .  . . ,  M-l.  The  values  of  a  needed  in  (9)  are  (using 
(6))  given  by 


a 


=  A 


£  exp  (-i2»pn/N)  x  (nA)  w(nA 
n 


-  kS)  . 


(10) 


In  order  to  put  (10)  directly  in  the  form  of  a  standard  FFT,  we  assume  that  the 
delay  separation  S  is  taken  as  an  integer  multiple  of  the  sampling  increment  A 


S  =  Is  A  .  (11) 

Then,  if  temporal  weighting  w  has  nonzero  samples  jw(nA)}  only  for  O^n^  N-l, 
(10)  becomes 

/P  \  N-l 

a| —  ,  kSJ  =  exp(-i2»pkIs/N)  A  £  exp  (-i2rrpm/N)  x(mA  +  klg  A)  w(mA), 
\na  /  m=0 

(12) 

0  <  p  <  N  -  1,  0  <  k  <  M  -  1  . 
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The  exponential  phase  factor  preceding  the  sum  (FFT)  in  (12)  takes  on  particu¬ 
larly  simple  forms  for  two  special  cases  of  delay  separation  S:  For  Is  =  N/2, 
delay  S  is  equal  to  half  the  temporal  weighting  duration  Lw-  and  is  termed 
50  percent  overlap;  for  I  =  N/4,  delay  S  is  one-quarter  of  andis termed 
75  percent  overlap.  For  these  two  cases, 

50  percent  overlap,  Is  =  N/2,  phase  factor  =  (-l)plc  ;  (13A) 

75  percent  overlap,  Is  =  N/4,  phase  factor  =  (-i)P^  .  (13B) 

By  proper  branching  in  a  computer  program,  no  storage  or  complex  multiplica¬ 
tions  are  necessary  to  incorporate  these  phase  factors  in  (12),  prior  to  its 
usage  in  (9).  (An  alternative  approach  that  completely  circumvents  the  phase 
factor  in  (12)  is  described  in  appendix  A. ) 

Equations  (12)  and  (9)  are  the  essential  results  of  interest.  We  now  inter¬ 
pret  them  by  means  of  simple  examples  that  will  enable  us  to  make  good  choices 
of  temporal  weighting  w,  delay  weighting  d,  and  separation  (overlap)  S. 

INTERPRETATION  OF  THE  VERNIER  SPECTRUM 


In  appendix  B,  the  vernier  spectrum  is  shown  to  be  given  in  terms  of  X  by 


Y (f»  v ) 


D(»0  •  ^{v)  , 


(14) 


where  all  the  convolutions  are  on  v,  with  f  held  fixed.  D(v)  is  the  delay 
window  corresponding  to  the  delay  weighting  d(r). 

The  linearity  of  the  two  Fourier  transforms,  (6)  and  (7),  on  the  data  x(t) 
indicates  that  we  can  investigate  the  behavior  for  data  components  separately 
and  merely  add  the  results.  The  fundamental  component  is 

x(t)  =  exp  (i2»f0t),  X(f)  =  S(f  -  fo)  .  (15) 

At  this  point,  we  shall  make  a  series  of  reasonable  assumptions  and  require¬ 
ments,  and  deduce  desirable  properties  about  the  weightings  and  separations. 
The  first  assumptions  are 

(a)  excitation  frequency  fQ  <  (2  A)-1, 

(b)  coarse  analysis  frequency  f  <  (2A)-1, 

(c)  >>  a  . 
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Assumption  (a)  avoids  aliasing,  (b)  restricts  analysis  to  the  fundamental  range, 
and  (c)  requires  the  temporal  weighting  to  cover  many  samples  of  the  process 
x(t) .  Furthermore,  if 

(d)  temporal  window  W  has  low  sidelobes, 

the  only  term  in  the  sum  in  (14),  after  substituting  (15),  that  has  substantial 
value  is  that  for  m  =  0,  and  it  yields 

Y(f,  v)  3  [w(f  -  fQ)  D(u  +  f  -  f0)]»  typ)  .  (16) 

A  plot  of  this  equation  versus  vernier  frequency  v  is  given  in  figure  3,  where 
Ld  is  the  length  (duration)  of  delay  weighting  d(r).  The  narrow  lobe  at  v  =  fQ  -  f 
is  the  desired  component;  this  component  is  displaced  from  the  coarse  analysis 
frequency  f  (corresponding  to  v  =  0)  by  fo  -  f  Hz,  which  places  it  at  absolute 
frequency  f  +  (fQ  -  f)  =  fQ,  as  desired.  The  shape  of  this  lobe  is  governed  by 
the  delay  window  D;  thus,  if 

(e)  Ld  »  S  , 

(f)  delay  window  D  has  low  sidelobes, 

the  large  lobes  separated  by  l/S  Hz  in  figure  3  will  not  overlap  significantly, 
and  potentially  good  spectral  estimation  is  possible.  The  necessity'  of  delay 
weighting  is  made  obvious  by  these  observations. 
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There  are  a  few  additional  points  worth  noting  about  figure  3.  The  peak  height 
of  the  lobes,  W(f  -  f^  D  (0),  is  a  function  of  the  exact  location  of  the  excitation 
frequency  fo  and  the  coarse  analysis  frequency  f.  This  undesirable  picket- 
fence  effect*  (which  was  not  present  in  figure  1)  can  be  minimized: 

(g)  choose  analysis  frequencies  jfjJ  closely  spaced  (see  figure  2B). 

Then  If^-  fQ|  is  small  for  some  value  of  k.  Also,  since  the  width  of  the  lobes 
in  figure  3  is  1/l^j,  where  is  the  delay  weighting  duration  and  will  betaken 
of  the  order  of  the  total  record  length  available  or  utilized,  very  fine  resolution 
in  v  is  possible.  Hence,  narrowband  components  closer  than  1/Lw,  the  resolu¬ 
tion  capability  of  a  single  time  segment,  can  be  resolved  by  using  this  technique. 

FFT  CONSIDERATIONS 


Samples  of  the  vernier  spectrum  Y(f,  v)  were  given  in  (9).  The  locations 
of  the  samples  aret 


1  2  N-l 

f:  0,  -  ,  -  ,  . . . ,  - 

NA  NA  NA 


1  2  M-l 

v,  0,  ■  ,  '  »  •  •  • »  — — 

MS  MS  MS 


(17) 


The  range  covered  by  the  vernier  frequency  v  is  S”1,  and  will  be  greater  than 
the  increment  in  f,  which  is  (NA)“1,  if  overlapped  temporal  weighting  is  used. 
And  since  the  full  range,  S~l,  would  encompass  a  spurious  lobe  for  values  of 
I  fo  -  f|  near  (2S)_1  (see  figure  3),  overlapping  is  necessary. 

The  approach  adopted  here  is  to  utilize  ail  the  samples  in  f  at  separations 
of  (NA)  ,  and  use  only  samples  in  v  which  cover  a  range  of  (NA)-*;  that  is, 
we  use  the  central  portion  of  Y  centered  around  v  =  0,  including  negative  fre¬ 
quencies.  In  terms  of  figure  2B,  adjacent  delay  weightings  at  fj,  io,  f3  will  be 
employed.  The  alternative  time-saving  procedure  of  attempting  to  utilize  all  of 
the  M  samples  in  v ,  and  using  only  enough  samples  in  f  to  fill  in  the  frequency 
axis,  can  lead  to  a  very  bad  picket-fence  effect,  in  addition  to  large  spurious 
lobes  at  undesired  frequency  locations.  These  conclusions  follow  upon  piecing 
together  several  vernier  spectra  like  figure  3  for  appropriate  values  of  f  and 
excitation  frequency  f  . 


*  Sec  reference  4,  page  47. 

f  the  upper  half  of  the  array  of  numbers  in  (17)  corresponds  to  negative  fre- 
,  .•  rhu.»  the  last  samples  in  each  array  correspond  to  f  =-(NA)-^  and 
• .  respectively. 
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EXAMPLES 

The  general  guidelines  furnished  in  the  previous  section  do  not  yet  enable 
us  to  make  quantitative  selection  of  good  weightings  for  different  degrees  of 
overlap.  To  make  this  selection,  several  examples  are  considered  and  compared. 
The  numerical  examples  utilize 

1  11 

A  =  - seconds,  N  =  1024,  —  =  —  Hz, 

1024  MS  8 

(18) 

f  =  256  /  — )  256  —  Hz  . 

°  W  2 

(A  sample  program  utilizing  (9),  (10),  (11),  (18),  and  the  method  of  appendix  A 
is  given  in  appendix  C  for  75  percent  overlap. ) 

50  PERCENT  OVERLAP 

At  50  percent  overlap  of  the  temporal  weighting,  *  several  possibilities  were 
tried.  They  included 


cosine  lobe  :  w(t)  =  cos  (wt/L^y) 
cosine2  lobe  (Hanning)  ;  w(t)  =  cos2  (wt/L^y) 
Dolph-Chebyshev  (Reference  5)t 


|t|  <  L.y/2  . 


(19) 


A  complete  list  of  cases  is  presented  in  table  1. 

In  figures  4A  through  41, $  decibel  plots  of  the  magnitude  of  the  estimated 
spectrum  are  given  for  cosine  temporal  weighting  and  for  (cosine)2  delay 
weighting.  All  plots  are  normalized  with  respect  to  a  maximum  of  0  dB,  which 
occurs  for  fo  =  f  =  256  Hz,  v  -  0  Hz.  Figure  4A,  for  example,  demonstrates 
the  behavior  predicted  by  figure  3,  namely,  the  presence  of  spurious  sidelobes 
every  S-1  =  (.5xl  s)“l  =  2  Hz.  The  largest  spurious  lobe  in  figure  4A  is 
-23.  5  dB  at  258  Hz.  The  slow  rate  of  decay  of  the  peaks  at  256  +  2n  Hz  is  due  to 
the  discontinuity  of  slope  of  w(t)  at  +  Lw/2  for  this  example.  The  desirable 
feature  of  a  narrow  mainlobe  is  attained,  as  indicated  in  figure  4.  The  succes¬ 
sion  of  plots  in  figure  4  shows  that  the  extent  of  the  picket  fence  varies  greatly 

♦When  these  weightings  are  employed  in  the  FFT,  they  are  delayed  by  Lvy/2 
seconds,  thereby  being  nonzero  in  the  interval  (0,  I^y). 

^A  quick  and  accurate  method  of  generating  the  Dolph-Chebyshev  weights  by 
means  of  efficient  use  of  an  FFT  is  presented  in  reference  6. 

^Figures  4  through  14  follow  the  text,  beginning  on  page  16. 
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Table  1.  Examples  of  Temporal  and  Delay  Weightings 


Figure 

Temporal 

Weighting 

Delay  Weighting 

Overlap 

(%> 

Number  of  Tones 

4 

cosine 

cosine2 

1 

5 

2 

cosine 

cosine2 

50 

1 

6 

Dolph-Chebyshev 

cosine2 

1 

7 

2 

cosine^ 

0 

cosine* 

75 

1 

8 

cosine 

cosine2 

75 

1 

|  9 

4 

cosine 

cosine2 

75 

1 

cosine5 

cosine2 

75 

1 

11 

Dolph-Chebyshev 

cosine2 

75 

1 

12 

cosine2 

flat 

75 

1 

13 

cosine2 

cosine2 

75 

2 

14 

cosine^ 

flat 

75 

2 

with  excitation  frequency,  reaching  a  maximum  of  -3.  20  dB  in  figure  4H  for 
fo  =  256  7/16  Hz.  (The  figures  for  fo  >  256  1/2  Hz  repeat  the  behavior  shown. ) 
The  worst  sidelobe  of  -23.0  dBoccursfor  fo  =  256  1/8  Hz,  as  shown  in  figure  4C. 

It  should  be  noted  that  if  sidelobes  were  to  be  measured  with  respect  to  the 
peak  on  that  sameplot,  figure4C  would  yield  a  sidelobe  of -23.  0+ 0.  13  =  -22.  9  dB. 
Thus,  the  convention  adopted  here  must  be  kept  in  mind  in  the  following  discussion. 

Instead  of  applying  the  weighting  directly  in  the  time  domain  by  means  of 
multiplication  on  the  data  x,  the  effect  of  cosine  weighting  can  be  accomplished 
in  the  frequency  domain  by  means  of  convolution  of  the  spectrum  with  the  se¬ 
quence  (i/2)  {l,  -if;  however,  the  resultant  must  be  interpreted  as  the  spectral 
value  between  the  two  quantities  convolved  at  each  frequency  step.  *  More  gen¬ 
erally,  (cosine)11  time  weighting  can  be  accomplished  alternatively  by  means  of 
convolution  of  the  (unweighted)  spectrum  with  the  sequence 

(t)  j1,  ~n*  (2)' "  (3) . (_1)  (20) 


♦This  possibility  and  its  interpretation  were  pointed  out  by  Dr.  N.  L. 
Owsley. 
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of  length  n  +  1,  and  then  interpreted  as  the  spectral  value  at  the  center  of  the 
region  convolved,  for  each  frequency  step;  see  appendix  D.  The  convolutional 
sequences  in  (20)  are  given  in  table  2  for  n  =  1  through  5. 


Table  2.  Convolution  Sequences 


n 

Convolution  Sequence 

1 

i  1/2  1 1,  -if 

2 

1/4  |-1,  2,  -if 

3 

i  1/8  {-1,  3,  -3,  if 

4 

1/16 1  1,  -4,  6,  -4,  if 

5 

i  l/32f  1.  "5,  10,  -10,  5,  -if 

Since  the  effect  of  (cosine)^  temporal  weighting  is  very  easy  to  incorporate 
in  the  frequency  domain  by  means  of  convolution,  it  must  be  considered  as  a 
candidate  for  weighting.  The  results  in  figures  5A  and  5B  show  that  although 
the  picket  fence  is  reduced  to  -2.  70  dB,  the  peak  sidelobe  increases  to  -15.4dB. 
(For  brevity,  we  are  now  presenting  only  selected  cases  of  worst  excitation 
frequencies. )  The  reason  for  the  increased  sidelobes  for  this  temporal  weight¬ 
ing  is  that  50  percent  overlap  is  not  yet  great  enough  to  realize  the  deeper  first 
sidelobe  level  of  -31.  5  dB;  that  is,  we  are  still  sampling,  according  to  figure  3, 
on  the  skirts  of  the  mainlobe  for  some  excitation  frequencies.  Generally,  for 
50  percent  overlap,  the  peak  sidelobe  will  occur  approximately  at  the  excitation 
frequency  such  that  the  worst  sidelobe  (or  mainlobe)  of  the  temporal  window 
beyond  f  =  1.  5/Lw  is  encountered;  this  may  be  seen  by  considering  figure  3 
and  recalling  that  we  plot  only  the  central  portion  of  Y(f,  v).  Thus  the  (cosine)n 
weightings  in  tables  1  and  2  for  n  >  2  are  not  acceptable  for  50  percent  overlap, 
since  sampling  of  the  mainlobe  is  encountered. 

The  realization  of  minimum  sidelobe  level  for  a  specified  beamwidth  (to  the 
first  null)  is  exactly  the  problem  addressed  by  Dolph,  reference  5.  Accordingly, 
this  weighting  is  of  considerable  importance  in  spectral  estimation.  In  figures  6A- 
6C,  the  effects  of  Dolph-Chebyshev  time  weighting  are  presented.  The  worst 
sidelobe  of  -33.  2  dB  occurs  for  fQ  =  256  1/8  Hz  (figure  6C).  These  results  are 
noticeably  better  than  in  figures  4  and  5. 

When  triangular  temporal  weighting  was  tried,  it  had  a  peak  sidelobe  of 
-20.  2  dB;  again,  we  are  samplingthe  skirts  of  the  mainlobe.  Thus,  if  50  percent 
overlap  is  all  that  can  be  utilized  for  some  applications,  due  perhaps  to  limited 
computation  time,  the  cosine-lobe  temporal  weighting  is  the  best  of  the  simply 
applied  windows  (that  is,  by  means  of  frequency  domain  convolution),  but  the 
Dolph-Chebyshev  time  weighting  is  10  dB  better  than  the  cosine  lobe  weighting. 
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75  PERCENT  OVERLAP 

At  75  percent  overlap  of  the  temporal  weightings  the  following  examples 
were  utilized: 

cosine2  lobe  (Hanning) 
cosine3  lobe 
cosine4  lobe 
co sine 5  lobe 
Dolph- Cheby shev 

The  results  for  the  Hanning  weighting  are  given  in  figure  7.  The  peak  sidelobe 
is  -41.  8  dB  at  fo  =  256  1/2  Hz  in  figure  7C,  and  the  picket  fence  is  -2.  60  dB 
at  fo  =  256  7/16  Hz  in  figure  7B.  Thus,  a  much  improved  sidelobe  level  is 
realized  relative  to  50  percent  overlap,  at  the  expense  of  increased  computa¬ 
tion  effort,  that  is,  increased  overlap  and  number  of  FFTs. 

In  an  effort  to  further  improve  performance,  the  (cosine)3  weighting,  which 
has  a  higher  degree  of  continuous  derivatives  at  +  Lw/2,  was  tried.  The  re¬ 
sults  in  figure  8  show  a  maximum  sidelobe  of  -51.  2  dB  and  a  picket  fence  of 
-2.34  dB. 


w(t)  =  cos2  (rt/Lw) 
w(t)  =  cos3  ( nt/Lw ) 
w(t)  =  cos4  (trt/Lw) 
w(t)  =  cos3  (rt/Lw) 


|t|  <  1^/2  .  (21) 


Continuation  of  this  effort  to  smoother  weightings  such  as  (cosine)4,  figures, 
shows  a  peak  sidelobe  of  -48. 1  dB  and  a  picket  fence  of  -2. 18  dB.  The  peak 
sidelobe  has  increased  over  that  for  (cosine)3  weighting  because,  for  75  percent 
overlap,  the  peak  sidelobe  (or  main  lobe)  of  the  temporal  window  beyond  approxi¬ 
mately  f  =  3.  5/Lw  is  encountered.  It  so  happens  that  the  worst  case  in  this 
region  is  larger  for  (cosine)4  than  for  (cosine)3  temporal  weighting. 

For  (cosine)3  temporal  weighting,  the  peak  sidelobe  is  reduced  further  to 
-54. 1  dB.  Also,  the  picket  fence  is  improved  to  -2.  07  dB;  see  figure  10.  This 
weighting  is  easily  accomplished  via  frequency  domain  manipulations;  see  table  2. 
An  alternative  temporal  weighting  of  virtually  equal  quality  to  (cosine)5  is  cubic, 
that  is,  sections  of  cubic  curves  that  have  continuous  derivatives  of  as  high 
order  as  possible.  The  temporal  window  is  proportional  to 

,  -,4 

sin  ( ff  Lyy  f/ 4)1 

irI^f/4  J  (22) 

and  has  a  worst  sidelobe  of  -53. 1  dB.  (The  picket  fence  was  not  computed. ) 
However,  a  cubic  temporal  weighting  is  not  easily  accomplished  in  the  frequency 
domain. 
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When  (cosine)6  temporal  weighting  is  considered,  it  is  found  that  75  per¬ 
cent  overlap  forces  us  to  sample  the  temporal  window  on  the  skirts  of  its  main- 
lobe.  This  is  an  unacceptable  weighting  because  the  peak  sidelobe  in  the  vernier 
spectrum  increases  significantly. 

The  possibilities  of  Dolph-Chebyshev  weighting  are  indicated  in  figure  11. 
The  worst  sidelobe  is  -86  dB  in  figure  11B  for  fo  =  256  1/4  Hz,  and  the  picket 
fence  is  -2.  29  dB  in  figure  11C  for  fo  =  256  7/16  Hz.  This  is  a  32-dB  improve¬ 
ment  in  sidelobe  level  compared  with  (cosine)6  weighting.  The  picket  fence  is 
degraded  by  0.  22  dB. 

To  determine  the  effect  of  not  using  delay  weighting,  figure  12  was  computed 
for  (cosine)2  temporal  weighting  and  flat  delay  weighting.  For  certain  excitation 
frequencies,  a  very  narrow  mainlobe  is  realized;  see  figure  12A.  However,  for 
other  excitation  frequencies,  the  lack  of  delay  weighting  creates  broad  "shoulders" 
of  significant  level;  see  figure  12B.  Also,  the  sampling  in  f  and  v,  inherent 
in  the  FFT,  produces  a  picket  fence  of  -5.0  dB.  Thus,  although  a  low  peak 
sidelobe  is  attained,  lack  of  delay  weighting  is  very  detrimental  to  performance, 
as  will  be  further  demonstrated  below.  Notice  from  figure  7  that  (cosine)2  delay 
weighting  also  yields  a  peak  sidelobe  of  -41.  8  dB,  but  has  no  broad  shoulders 
and  has  a  picket  fence  of  only  -2.  60  dB. 

The  detrimental  effects  of  no  delay  weighting  are  best  illustrated  by  a  com¬ 
parison  of  figures  13  and  14,  which  have  two  tones  separated  by  1/2  Hz,  one 
15  dB  stronger  than  the  other.  It  is  seen  that  these  two  tones  are  resolved,  even 
though  they  are  closer  than  the  resolution  capability  of  the  individual  time  seg¬ 
ments,  that  is,  closer  than  1  Hz.  In  figure  13,  (cosine)2  delay  weighting  is 
employed;  in  figure  14,  none  is  employed.  A  comparison  of  part  A  of  these 
figures  reveals  that,  for  excitation  frequencies  256  and  256  1/2  Hz,  the  flat 
delay  weighting  is  better.  However,  for  excitation  frequencies  256  1/16  and 
256  9/16  Hz,  the  presence  of  the  weaker  tone  is  clearly  evident  in  figure  13B, 
but  hardly  discernible  in  figure  14B(no  delay  weighting).  The  presence  of  noise 
would  obscure  the  weaker  peak.  Thus,  although  the  peak  sidelobe  may  be  very 
small,  the  presence  of  high-level  broad  shoulders  must  also  be  eliminated  by 
use  of  delay  weighting. 


CONCLUSIONS 

An  approximate  and  quick  FFT  technique  for  vernier  spectral  analysis  is 
possible  by  employing  overlapped  temporal  weighting  and  delay  weighting.  For 
50  percent  overlap  and  Hanning  delay  weighting,  the  best  simply-applied  tem¬ 
poral  weighting  discovered  was  a  single  cosine  lobe,  realizing  a  peak  sidelobe 
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of  -23  dB.  However,  Dolph-Chebyshev  temporal  weighting  achieves  -33  dB  side- 
lobes.  For  75  percent  overlap  and  Hanning  delay  weighting,  the  best  simply  ap¬ 
plied  temporal  weighting  discovered  was  (cosine)5,  which  realized  apeak  sidelobe 
of  -54  dB.  However,  Dolph-Chebyshev  temporal  weighting  is  capable  of  -86  dB 
side  lobes.  Which  overlap  and  weighting  to  employ  depends  on  the  limitations  on 
computation  time  and  storage,  and  on  the  relative  strength  and  location  of  inter¬ 
fering  tones. 

The  overlap  is  not  limited  to  the  above  choices.  It  could,  for  example,  be 
67  percent.  The  best  weightings  were  not  investigated  in  this  case,  because  it 
was  felt  that  the  above  overlaps  were  easier  to  implement  in  most  cases  of 
practical  interest.  However,  Dolph-Chebyshev  weighting  is  always  a  strong 
candidate  and  is  quickly  and  accurately  generated  (reference  6). 
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246  256  266 

Frequency  (Hz) 


Figure  4A.  =  256  Hz 
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Frequency  (Hz) 

Figure  4B.  fQ  =  256  1/16  Hz 

Figure  •).  Vernier  Spectrum  for  Cosine  Temporal  Weighting,  50  percent  Overlap 


Relative  Power  (dBI  Relative  Power  (dB) 


TR  4767 


246  256  266 

Frequency  (Hz) 


Figure  4D.  fQ  =  256  3/16  Hz 

Figure  4  (Cont'd).  Vernier  Spectrum  for  Cosine  Temporal  Weighting, 

50  percent  Overlap 
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111 

II 


Hill 


Frequency  (Hz) 

Figure  4G.  (n  =  256  6/16  Hz 
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Frequency  (Hz) 

Figure  4H.  f^,  =  256  7/16  Hz 


Figure  4  (Cont'd).  Vernier  Spectrum  for  Cosine  Temporal  Weighting, 

50  percent  Overlap 


Relative  Power  (dB) 
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Frequency  (Hz) 

Figure  41.  fQ  =  256  1/2  Hz 

Figure  4  (Cont'd).  Vernier  Spectrum  for  Cosine  Temporal  Weighting, 

50  percent  Overlap 


20 


Relative  Power  (dB)  Relative  Power  IdB) 


TR  4767 


246  256  266 

Frequency  (Hz) 


246  256  266 

Frequency  (Hz) 


Figure  5B.  fQ  =  256  7/16  Hz 

Figure  5.  Vernier  Spectrum  for  (Cosine)"  Temporal  Weighting, 

50  percent  Overlap 
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Figure  6A.  f0  =  256  Hz 


6  256 

Frequency  (Hri 


Figure  6B.  fQ  -  256  1  H.* 


Figure  6.  Vernier  Spectrum  for  Dolph-('h<S\  •  n 
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Frequency  (Hz) 


Figure  6C.  f0  =  256  7/16  Hz 

Figure  6  (Cont'd).  Vernier  Spectrum  for  Dolph-Chebyshev  Temporal 
Weighting,  50  percent  Overlap 


Relative  Power  (dB) 


Figure  7.  Vernier  Spectrum  for  (Cosine)2  Temporal  Weighting,  75  percent  Overlap 
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Frequency  (Hz) 

Figure  7C.  £<>  =  256  1/2  Hz 

Figure  7  (Cont'd).  Vernier  Spectrum  for  (Cosine)2  Temporal  Weighting, 

75  percent  Overlap 
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Frequency  (Hz) 
Figure  8A.  *  256  Hz 
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Frequency  (Hz) 

Figure  8B.  f,,  =  256  1/8  Hz 

Figure  8.  Vernier  Spectrum  for  (Cosine)**  Temporal  Weighting,  75  percent  Overlap 
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Figure  8C.  tQ  =  256  7/16  Hz 

Figure  8  (Cont'd).  Vernier  Spectrum  for  (Cosine)3  Temporal  Weighting, 

75  percent  Overlap 
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Frequency  (Ha) 
Figure  9A.  fQ  *  256  Hz 
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Frequency  (Hz) 

Figure  9B.  fQ  =  256  7/16  Hz 

Figure  9.  Vernier  Spectrum  for  (Cosine)4  Temporal  Weighting,  75  percent  Overlap 
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Frequency  (Hz) 

Figure  9C.  fQ  =  256  1/2  Hz 

Figure  9  (Cont'd).  Vernier  Spectrum  for  (Cosine)4  Temporal  Weighting, 

75  percent  Overlap 
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Frequency  (Hz) 

Figure  10B.  fo  *  256  1/8  Hz 

Figure  10.  Vernier  Spectrum  for  (Cosine)®  Temporal  Weighting, 

75  percent  Overlap 
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Figure  IOC.  ^  =  256  7/16  Hz 

Figure  10  (Cont'd).  Vernier  Spectrum  for  (Cosine)5  Temporal  Weighting, 

75  percent  Overlap 
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Figure  11B.  f0  =  256  1/4  Hz 

Figure  11.  Vernier  Spectrum  for  Dolph-Chebyshev  Temporal  Weighting, 

75  percent  Overlap 
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Figure  11C.  ^  =  256  7/16  Hz 

Figure  11  (Cont'd).  Vernier  Spectrum  for  Dolph-Chebyshev  Temporal 
Weighting,  75  percent  Overlap 
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Figure  12A.  ^  =  256  1/2  Hz 
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Figure  12B.  fQ  =  256  7/16  Hz 
Figure  12.  Vernier  Spectrum  for  Flat  Delay  Wt-.ghting 
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Appendix  A 


no  +  N  -  1 

TWO  METHODS  OF  COMPUTING  ^  exp  (-i2#pn/N)  qn 


n  =  n. 


Define 


no+  N-l 

Qp  =  exp  (-i2irpn/N)qn,  0  <p  <  N  -  1  , 

n  =  no 

where  >  0.  If  we  let  m  =  n  -  rig,  (A-l)  becomes 

N-l 

Qp  =  exp  (-i2*pno/N)  ^  exp  (-i2trpm/N)  qm  + ^  . 

m  =  0 

,no  +  N  -  1 

The  sum  in  (A-2)  is  an  FFT  of  the  sequence  jqnfn^ 


(A-l) 


(A-2) 


For  an  alternative  method,  consider  the  general  term  qn  in  (A-l).  Then, 


if 


(a)  n  =  0,  N,  2N, ....  qn  gets  weight  exp  (-i0)  ; 

(b)  n  =  1,  N+  1,  2N+  1, . . . ,  qn  gets  weight  exp  (-i2»p/N)  ; 


(A-3) 


(c)n  =  N-l,  2N-1,  3N-1, ...»  qn  gets  weight  exp  (-i2»p(N-  1)/N) 
So,  if  we  define  n  =  n  mod  N,  then  case 

(a)  corresponds  to  n  =  0  ; 

(b)  corresponds  to  n  =  1  ; 


(A-4) 


Therefore,  let 


(c)  corresponds  to  n  =  N  -  1  . 


v.  =  qn,  n0<n<n0+N-l, 


(A -5) 


A-l 
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in  which  cane 


N  -  1 


v£. 


exp  (-i2»pn/N)  v.  ,  0<p<N*l; 


(A-6) 


this  is  simply  an  FFT  of  |vR  }Q  ,  with  no  phase  factor  necessary.  Equation 
(A-5)  corresponds  to  Ailing  up  the  array,  from  the  given  quantities  qn, 
starting  from  the  nonzero  position,  Hq  mod  N,  and  cycling  around  to  position  0. 

To  apply  these  results  to  (10),  suppose  weight  w  is  nonzero  for  t  >  0  . 
Then  if  ^  is  the  smallest  integer  such  that  iiq  >  k  S/A ,  (10)  can  be  expressed  as 


Oq  +  N-l 


/  p  \  * 

I - ,  ksj  =  A  2  exp(-i2#pn/N)  x(nA)  w(nA  -  kS) 

\NA  /  ", 


(A-7) 


n  -  n<j 


This  is  of  the  form  of  (A-l)  if  we  identify 


qn  =  x(nA)  w(nA  -  kS)  . 


(A-8) 
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Appendix  B 

DERIVATION  OF  VERNIER  SPECTRUM 
From  the  first  line  of  (7)  in  the  main  text,  there  follows  Immediately 

Y(f,  a)  *  A(f,  *)  •  D<*)  •  i^p)  ,  (B-l) 

where  all  convolutions  are  on  p,  for  f  fixed.  Then  using  (6),  we  obtain 

A(£,*)  *  j'  dr  exp  (-12 »  pr)  a(£,r) 

-  f  dr  exp  (-12*  p  r)  f  dt  exp  (—12 » ft)  x(t)  w(t  -  r )  (t)  (B-2) 

j  j  a 

*  W <-*)/  dt  exp  <-i2«(f  ~  *)  t)  x(t)  A^(t) 

-  w<-'>  Ex([* '  -  t)  • 

Substituting  (B-2)  in  (B-l),  we  have 
Y(f,  *)  *  |W(-*)£x| 

L  m 


f  -  *  - 


V* 


(B— 3) 
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Appendix  C 
SAMPLE  PROGRAM 


The  program  furnished  below  is  illustrative  of  the  vernier  technique.  It 
has  been  written  for 


1  1 

A  (DEL)  = - seconds,  N  =  1024,  M  =  32,  S  *  —  sec 

1024  4 

fQ(F0)  =  256(1/16)  256-i  , 


(C-l) 


but  could  be  easily  changed  to  other  cases.  The  input  data  are  furnished  by  inter¬ 
nal  functions  XREAL  and  XIMAG;  currently,  two  tones  of  relative  strength 
-15  dB  and  separation  1/2  Hz  are  incorporated.  Loop  1  in  the  main  program 
accomplishes  Hanning  temporal  weighting,  while  loop  2  accomplishes  Hanning 
delay  weighting.  The  subroutines  MKLFFT  and  QTRCOS  are  detailed  in  refer¬ 
ence  7.  The  method  in  this  program  employed  the  cycling  technique  described 
in  appendix  A. 


P*HA«ET£R  NslCUu  *  M=32 ,  *4sN/4*l  ,  M4sM/4*l 
INTESEK  PS 

DIMENSION  2R(N),ZI(N)«a(N),D(M)»AR(2l*M)«Al(2l,M) , ADR(M) »ADI (M) , 
SDQ(M)  ,CN(N4) ,CM(M4) ,2(200)  tX(leO)  *T  (160) 

SQSl0,**(-.75) 

TplMs|. *3, 1415*2654/ IM^l) 

TPlNst,*3,l4l592654/N 

MisM-1 

nisn-1 

DElsI./N 
ISSN/ 4 
SslS'DEL 

lMSINTaO®tFi.D*T(M)  )*1, 44274, 5) 

Caul  8TRC0S(CN#4) 

CauU  OTRCOS(CMm) 

Caul  NODES® (Z,0) 

Call  sjsje®( z, 5. , -100. , i65, , o. ) 

Call  06JCT8 (2,030., 1400. » 2425., 2350.) 

DO  1  M»s0,Nl 

1  ■tMS»l)sl.-cOSiTPlN*MS> 

Do  2  KSsO, Ml 
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2  D(AS*1>31.-C0S(tPXM*(KS«1)) 

DO  14  lCsl'16* 

13  X(1C)«1C 

00  12  IFsO'8 

F0=256.*ir«.0625 

Do  3  KSsO.Nl 

N031S9AS 

Nu=NO*Nl 

DO  9  NSsNO  *  Ml* 

NTSMOO(NSfM) 

ZR  l  NT*1 )  sXRE AL  (  nS*  jEL  )  •«  ( NS-NO+ 1 ) 
ZIINT*1)3XI*u«InS*uEL)**(NS-N0*1) 
i*  CONTINUE 

CALL  MKLFFT<ZR»Zl#wN»10*-l) 

00  s  P*s2<»6'266 

AR (PS-295# KS*l)s  OtL*ZR(PS»l) 

AHPS-295»KS*l>s  OtL*Zl(PS>l) 

5  CONTINUE 

3  CONTINUE 

00  6  PSs2R(>,266 
Do  7  K&sOfNl 

AQR (<$♦!) sS*AR (pS-295, KS*1)*0<kS*1) 

A02  ( AS^l )  sS«Al  (  pS-295 ,  <S*1 )  *0  ( *S*1 ) 

7  continue 

Call  nklffT(aoh.aox,cm,im.-i) 

XFlIF,E8.0.AN0.pS.t0.256)  PEAKsi 0 • *L00l 0 ( AOR ( 1 ) **2» AOl ( 1 ) »*2 ) 
00  6  XSsO'Ml 

AR(PS-295,*Stl)-A0rt(KS*l) 

A I (PS-295#  <S+1):A3I(KS*1) 

6  CONTINUE 

ft  CONTINUE 

DO  9  PSs296,26e 
IC*( PS-296) «6 
DO  10  MSsO»Mi 

A#AR  (PS-295,  MSM  J*»2^AI(PS-2*»5.NS4lJ«*2 
4»NAX(A*l.E-36) 

DB(*S*1 ) sio . PLOslO ( A  > -PEAK 
10  CONTINUE 

PRINT  iH.  PS 
19  FORNAT(///UO/) 

PRINT  il»  OB 

U  F0RmAT(/BC15.6) 

Y(ICU)308(N-3> 

T ( IC+2) 30B(m-2) 

T ( 1C+3) sOB (M-l) 

Y(1C*9)30B(M) 

Y { IC+5) s06( l ) 

Y(lC*t)sOS(2) 

Y<1C*T)sOB(3) 

Y ( 1C+I) 306(A) 

9  CONTINUE 

SIUE3-200. 

00  17  lCsl»16« 

lF(lCf6E.69,ANO.IC.LC.109)  BO  TO  17 
SIWE3NAX(SI0E»Y(IC) ) 
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17  CONTINUE 

MINT  It'  SI9C 

it  FOftNATC/'  PEAK  SlDtlOtfc  XS'.ElA.t) 

Caul  S£TSM6(2'AO'l.) 

00  is  XF*tO,tO»lO 

CAlL  LIN£St(Z*0,  S. '-FLOAT ( IF) > 

IS  Call  LINESt(Z»i. ItS* '-FLOAT ( IF) ) 

00  It  lF*U'l57.a 

Call  LXNCSt(Z'O.^LUAT(lF)'0.) 
it  CALL  LXNeSt(Z'l.FLOAT(lF)'-iOO.) 

CALL  S£TS*S(Z.Sn'2*> 

CALL  LXN£S»(Z'0.S.'-100.) 

Call  LXNCSt(Z' X'S' *0' ) 
call  LiN£St(Z*i.iti>.»o.) 

Call  LXN£Si(2' 1 , its* ' -100' ) 

Call  LINCSStZ'l.S. '-100*1 
Call  Ll N£SS ( Z » 1 6®  *  *  *  T ) 

Call  FAtct(Z'O'S'i) 

12  CONTINUE 

call  Caito(z) 

faint  11'  FCAK 

function  xaeal(t) 

XAEALsCOS ( 2 . . 1 *la«2tS»*F0*T ) 

S  ♦COS  <  2 . •! . l  mstttst*  ( F0»  .  5 )  »T )  •*§ 
AeIUAN 

Function  xina*u> 

XINA»t*IN<2.*'S.i»l3*tt*%«F0«T) 

%  ♦SlN(2.*d.itla*2tSt*<F0A,»)*T)»St 

Ac TUAN 
EnU 


C-3/C-4 
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Appendix  D 

EFFECT  OF  (COSINE)n  TIME  WEIGHTING 

Let  us  define  the  spectrum  of  unweighted  data  x  as 

N  - 1 

Z(f)  =  f  dt  exp  (-i2#ft)  x(t)  A«A(t)  =  4  V  exp  (-i2*kAf)  x(kA),  (D-l) 

k  =  0 

and  that  corresponding  to  weighting  w  as 

Zw(f)  =  J dt  exp  (—12 » ft)  x(t)  w(t)  A5A(t)  =  Z(f)  •  W(f) 

(D-2) 

=  A  2^  exp  (-i2»kAf)  x(kA)  w(kA)  . 
k  =  0 

Now  for  (cosine)11  time  weighting,  we  have* 

w(kA)  =  sin11  (k*/N),  0  <  k  <  N  -  1  .  (D-3) 

Substituting  (D-3)  in  (D-2),  there  follows 
N  - 1 

Zw(f)  =  A  ^  exp  (-12rkAf)  x(k  A) 
k  =  0 


the  last  step  via  use  of  (D-l).  This  result  yields  (20). 


&[ 


exp  (ik»/N)  -  exp  ( 


-iktr/N) 


*See  the  first  footnote  to  equation  (19)  of  the  main  text  for  the  explanation 
of  sinn  in  (D-3). 
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For  the  special  case  of  n  =  1,  (D-4)  becomes 

z^-11/2[z(,~)-z(f*i)]- 


and  in  particular. 


(D-5) 


(D-6) 


The  right-hand  side  of  (D-6)  involves  two  adjacent  spectral  values  as  afforded 
by  the  standard  N-point  FFT  in  (D-l).  The  left-hand  side  of  (D-6)  is  the  spec¬ 
tral  value  of  Zw  at  the  frequency  halfway  between  the  above  two  spectral  loca¬ 
tions.  Thus,  under  this  interpretation  of  the  right-hand  side  of  (D-6),  the  desir¬ 
able  sidelobe  control  predicted  by  the  convolution  in  (D-2)  can  be  attained.  A 
similar  interpretation  is  possible  for  (D-4)  for  general  n. 
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APPROXIMATIONS  FOR  STATISTICS  OF 
COHERENCE  ESTIMATORS 

INTRODUCTION 


Expressions  for  the  probability  density  function,  the  cumulative  distribution 
function,  and  any  moment  of  the  estimates  of  magnitude-squared  coherence 
(MSC)  and  magnitude  coherence  (MC)  are  available  in  references  1-5.  The 
expressions  for  the  moments  usually  involve  a  generalized  hypergeometric 
function*  3F2  and  require  a  time-consuming  computer  effort  for  their  evalu¬ 
ation.  Also,  the  fundamental  dependence  of  statistics  like  the  bias,  variance,  and 
mean-square  error  on  the  number  of  averages  N  and  the  true  coherence  are 
not  obvious,  because  of  the  lack  of  significant  results  for  the  3F2  function. 

This  report  will  seek  to  present  accurate  approximations  for  these  statis¬ 
tics,  of  as  simple  a  nature  as  possible,  and  capable  of  hand  calculation.  Also, 
the  dependence  on  N  and  on  the  true  coherence  will  be  deduced,  and  thereby 
future  experiments  can  be  designed  in  which  the  required  stability  can  be  pre¬ 
dicted  and  attained  with  ease  and  certainty.  As  a  by-product,  a  technique  for 
reducing  a  particular  type  of  3F2  function  to  a  Gauss  hypergeometric  function 
(reference  7,  chapter  15)  is  presented. 

ESTIMATION  OF  MAGNITUDE-SQUARED  COHERENCE 


The  complex  coherence  between  two  jointly  stationary  random  processes 
x(t)  and  y(t)  is  defined  as 


Gxy® 

1/2  ’  M 

xx®  Gyy®] 

where  Gxy(f)  is  the  cross-spectral  density  at  frequency  f,  and  G^f)  and 
Gyy(f)  are  the  auto-spectral  densities.  The  MSC  is 

C(f)  =  |7xy(f)|2  .  (2) 


•See,  for  example,  reference  6,  section  9. 14. 
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The  MSC  is  frequently  estimated  according  to 


Gyy(^) 


N 


E  xn<*> 

n=l 


E  k<e|2  Ek^l2 

r>=1  n=  1 


(3) 


where  N  is  the  number  of  data  segments  employed,  and  Xn(f),  Yn(f)  are  the 
(discrete)  Fourier  transforms  of  the  n-th  weighted  data  segments  of  x(t)  and  y(t). 


GENERAL  RELATIONS 


The  m-th  moment  of  the  random  variable*  C  for  independent  data  segments 
is  given  in  reference  1,  (4)  and  reference  2,  (3)  as 

E  |cm|  2  ■"  i1  -  C>N  3F2  (m+1>  N»  N;  N  +  m,  1;  C),  (4) 

where  C  is  the  true  MSC  and  3F2  is  a  generalized  hypergeometric  function. 
The  power  m  need  not  be  integer  in  (4). 

For  m  =  1,  the  first  moment  of  C  can  be  reduced  (reference  5,  appendix  B) 
to  the  simpler  (and  rapidly  convergent)  form 

“1’ l4"lric  F(1, 1;N*2iC)  ■  <5> 

where  F  is  the  Gauss  hypergeometric  function.  For  m  =  2,  the  second  moment 
of  C  can  be  reduced  to  the  simpler  form  (see  appendix  A) 

N3  -  2N2  +  2N  -  2  N  -  1  r  „ 

„  - - + - In2  -  (N  -  2)  Cj  F (1,  1;  N  +  2;  C)  ,  (6) 

2  N  N  +  1  L  J 

which  involves  the  F  function  with  the  same  arguments  as  in  (5).  Equations  (5) 
and  (6)  give  exact  results  from  which  the  bias,  variance,  and  mean-square  error 
of  the  MSC  estimate  C  can  be  obtained. 


•The  f-dependence  is  suppressed  for  convenience. 
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BIAS  APPROXIMATION 
The  bias  of  C  is 

Bias(C)  =  EjcIc.N}  -  C  =  -  C  .  (7) 

By  expanding  F  in  (5)  in  a  power  series  in  C  and  retaining  terms  to  order  N“2, 
we  obtain  the  approximation 

Bias(C)  (1-  C)2  (l+^Si  •  (8) 

Plots  of  (7)  and  (8)  are  given  in  figure  1  for  N  =  8  and  16.  The  discrepancy  be¬ 
tween  the  exact  result  (7)  and  the  approximation  (8)  is  barely  discernible  for 
N  =  8  and  is  not  discernible  for  N  =  16.  The  discrepancy  (between  (7)  and  (8)) 
is  even  less  for  larger  N.  Equation  (8)  is  a  much  simpler  and  more  accurate 
approximation  than  reference  2,  (5).  The  bias  and  approximation  are  observed 
to  have  a  peak  of  value  l/N  at  the  origin  and  to  decrease  monotonically  with 
the  value  C  of  the  true  MSC. 


1/N 


BIAS  <C) 


Figure  LA.  N  =  8 
Figure  1.  Bias  of  MSC  Estimate 


3 


TR  5291 


1/M 


BIAS  <£) 


I 

I 


^WACT 

APPROXIMATION'''*' 


Figure  IB.  N  *  16 

Figure  1  (Coat'd),  Bias  of  MSC  Estimate 


VARIANCE  APPROXIMATION 

An  expansion  for  the  variance  of  C  is  given  in  reference  2,  (6).  Lf  we 
expand  the  bracketed  term  to  order  N~l,  we  obtain  the  approximation 


v«,.^a-c,2r2C.-ic^3cf] 

This  result  can  also  be  obtained  from  the  exact  expression 

Variance(C)  =  j*2  " 


(9) 


(10> 


combined  with  (5)  and  (6). 

Plots  of  (9)  and  (10)  are  given  in  figure  2  for  N  =  8  and  16.  The  discrepancy 
between  (9)  and  (10)  is  barely  discernible  for  N  =  16  and  is  not  discernible  for 
N  _>  32.  Equation  (9)  is  a  much  simpler  and  better  approximation  than  refer¬ 
ence  2,  (6). 

For  large  N,  the  peak  of  the  variance  occurs  at  C  S  1/3  and  is  of  value 
9/27  N"1.  Thus,  even  when  the  true  coherence  is  unknown,  the  maximum  vari¬ 
ance  will  be  less  than  0.3/N,  for  large  N. 
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.3/M 


|  ^APPROXIMATION 

l 

iXACT'*' 

\ 

IAMCS  <C) 


Figure  2A.  N  *  8 


3/N 


tXACT 

V 


APPROXIMATION 

IANCC  < C ) 


I 


0  - - 

0  c 

Figure  2B.  N  =  16 
Figure  2.  Variance  of  M SC  Estimate 
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MEAN-SQUARE  ERROR  APPROXIMATION 

The  mean-square  error  of  the  MSC  estimate  C  is 

Mean-Square  Error  (C)  -  E  |(C  -  C)2| 

*  [Bias  (C)]2  +  Variance  (C) 

*  (Mx  -  C)2  +  (a2  -  a2)  =  s2  ~  2 C +  C2  .  (11) 

This  exact  result  can  be  computed  by  means  of  (5)  and  (6).  If  we  substitute 
approximations  (8)  and  (9)  in  (11)  and  retain  terms  of  the  two  highest  orders  in 
N,  we  obtain 


Mean-Square  Error  (C)  s 


N  ♦  1 


(1  -  C)' 


„  1  -  5C  ^  7C2 

Lc - n — j  * 


ii2l 


Plots  of  (11)  and  (12)  are  presented  in  figure  3  for  N  =  8  and  16.  The  dis¬ 
crepancy  between  (11)  and  (12)  is  discernible  for  N  =  16  but  cannot  be  seen  for 
N  _32. 

For  large  N,  the  peak  of  the  mean-square  error  occurs  at  C  I  13  and  is 
of  value  8  27  N'1. 


^approximation 


iXACT^ 


MEAN  - 
SOUANC 
ERROR  (C) 


0  ' 
0 


Figure  3  A.  N  -  8 
Figure  3.  Mean-Square  Error  of 
MSC  Estimate 
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a/N 

t*ACT  „ 

\ 

\ 

\ 

\ 

APPROXIMATION 


MC  AN  ■ 

souarc 

ERROR  (£) 


Figure  3B.  N  =  16 

Figure  3  (Cont'dt.  Mean-Square 
Error  of  MSC  Estimate 

ESTIMATION  OF  MAGNITUDE  COHERENCE 

The  magnitude  coherence  (MC  t  is  defined  as 

S(f)=|Vf)|=vc(fr  (i3» 

upon  use  of  (2t.  The  estimate  of  MC  is 

S(f)  =  s/c(fi  ,  (14) 


where  C(f)  is  defined  in  (3). 

GENERAL  RELATIONS 

The  first  moment*  of  S  is  available  from  (4)  by  setting  m  =  1/2: 


e{s{ 


r(N)r(3/2),,  «.2n 

'  T(N  *  1  2.  (1  "  6  ’ 


l  F2  (■§"• 


N,  N;  N 


1/2,  1;  S2)  . 


(15) 


•The  f-dependence  ;s  suppressed  for  convenience. 
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The  second  moment  of  §  la  directly  available  from  (5): 

EjS2}  -  ~  ’  s2  F{U  1;  N*2;  s2)  *  (16> 

It  will  be  noticed  that  (15)  and  (16)  are  even  functions  of  S;  this  information  will 
be  useful  in  the  approximate  forme  to  be  adopted  later.  Equations  <  15  >  and  (16) 
give  exact  results  from  which  the  bias,  variance,  and  mean-square  error  of  the 
MC  estimate  S  can  be  obtained. 


A  significant  difference  now  exists  between  treatment  of  the  MC  estimate 
and  the  MSC  estimate:  whereas  (4i  could  be  reduced  to  an  F  function  for  m  an 
integer,  no  such  reduction  has  been  discovered  for  (15t.  Further,  <15i  is  not  an 
appealing  analytic  result,  as  may  be  anticipated  by  noticing  that,  since  <15>  must 
equal  unity  at  S  =  1,  and  the  leading  factor  contains  an  N-th  order  £ero  at  S  =  1, 
then  3F2(.  .  . )  must  contain  an  N-th  order  pole  at  S  -  1.  No  transformations 
or  useful  approximations  of  the  3F2  function  in  (15)  were  discovered  in  refer¬ 
ences  6-11. 

EXPANSIONS  ABOUT  S  =  0 

A  direct  series  expansion  of  (4)  yields 

E{§2m(  s  Wri”  — —  ^  1  *■  m  ^  — —  5- 

T(N  *•  m)  I  N  ♦  m 


- [(N2-  N)(m-  It  *  2<m  ♦  li]  S4 

♦  m  (N  ♦  1  ♦  m)  L  J 


4(N  *  m)(N 


.  a:  1 


Now,  if  m  =  1,  the  N2  and  N  terms  in  the  S4  term  drop  out,  and  we  get  a 
useful  development  in  which  the  terms  decay  with  N: 


El§2J 


I  N~  1  c2 
N  "n*1 


N  -  1 

(N  *■  1)(N  ♦  2) 


But,  if  m  =  1/2,  we  obtain 


c-fSl  ,  T(N)r(3/2)  )  N(N-l)  c2  N(N  -  1)(N2  -  N  -  6)  ,4  ( 

'  '  r'(N  +  l/2)  r  2N  *  1  4(2N  ♦  1)(2N  ♦  3)  '  '  ‘  i 


(Is ) 


(19) 


and  the  coefficients  of  S2,  S4, .  .  .  increase  with  N,  in  direct  contrast  to  re¬ 
sults  for  MSC  estimation.  This  increase  is  due  to  the  two  numerator  terms  and 
one  denominator  term  in  3F2  in  (15)  that  depend  on  N. 
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EXPANSIONS  ABOUT  S  »  1 

If  the  results  In  (5)  and  (6)  are  expanded  about  S  *  C  «  1  by  means  of  ref¬ 
erence  7,  equation  15. 3. 11,  we  find  the  asymptotic  expansions 


E|C}»c  ^-j3T2  a-o 


2! 


(N-2)(N-3) 


(1-C)3 


3! 


(N -  2)(N  -  3)(N  -4) 


(1-C)*  ♦  .  .  . 


(20) 


and 


Eie2}  - 1  -  2(i- C)  --^rf-a-c)2 


4(N  +  1)  (1-C)3  + 


6(3N  +  2) 


(N  -  2)(N  -  3) 


(N  -  2)(N  -  3)(N 


(1  - C)  +  .  .  . 


(21) 


C2'NT2(1-C2>2- 


(N  -  2)(N  -  3) 


2  3 

(1-C2) 


6  f 


N-N2 


16 


(N  -  2)(N  -  3)(N  -  4) 


2  ^ 

(1-C*)  +  .  .  . 


(22) 


upon  regrouping  terms.  Expressions  (20)  and  (22)  are  useful  near  C  *  1  and 
Indicate  bow  rapidly  E |Cmf  -  Cm  approach  zero  as  C  approaches  one,  for 
m  *  1  and  2.  It  will  be  observed  from  (20)  and  (22)  that  the  coefficients  of 
(1  -  Cm)2  and  (1  -  Cm)3  are  identical,  and  those  of  (1  -  Cm)4  are  similar. 

It  was  thought  that  E{§|  =  e|£1/2}  might  possess  a  similar  expansion  in 
powers  of  (1  -  C^2)  =  (1-S)  and  provide  a  useful  method  of  evaluating  (15),  at 
least  near  S  *  1.  In  appendix  B,  It  is  Indeed  shown  (after  considerable  labor)  that 


1  N2  -  7N  ♦  16 
16  (N  -  2)(N  -  3) 


(1-C)3  ♦  .  .  . 


(23) 


S  ♦ 


N-2 


(i-sr  - 


(N-2)(N-3) 


<1-  S)J  ♦ 


(24) 
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* 


(upon  regrouping  terms),  which  has  the  identical  coefficients  as  (20)  and  (22), 
up  through  the  order  computed.  Equation  (24)  shows  that  the  bias  of  the  MC 
estimate  §  approaches  zero  as  S  approaches  one  according  to  (l-S)2/(N-2). 
Also,  (24)  and  (20)  can  be  combined  to  show  that 


-  (l-S2) 

Variance  (S)  = 


as  S  -*  1 


(25) 


This  corroborates  reference  4,  (8). 

CHOICE  OF  APPROXIMATION 

Expansions  like  (20)- (25)  cannot  be  used  to  evaluate  the  desired  statistics 
for  small  S;  in  fact,  they  are  divergent  asymptotic  expansions.  When  this  in¬ 
formation  is  combined  with  the  earlier  results  about  S  -  0,  we  find  that  direct 
analytic  expansions  of  (15)  are  not  fruitful,  in  contrast  with  the  earlier  approach 
for  MSC  results.  Instead,  we  must  adopt  some  convenient  simple  approximation 
and  try  to  match  It  to  the  exact  results  in  some  fashion.  (The  techniques  in  ref¬ 
erence  12,  chapter  9,  are  relevant  in  this  regard. ) 

Before  we  do  that,  however,  it  is  necessary  to  digress.  We  know  that 

Bias  (S)  =  Ejs}  -  S  ,  (26) 

Variance  (S)  =  E{§2}  -  E2{§{  ,  (27) 

Mean-Square  Error  (S)  =  [Bias  (§)]  2  +  Variance  (S)  ,  (28) 

where  the  exact  moments  are  given  in  (15)  and  (16).  A  very  good  approximation 
to  E{§2}  =  E  jc}  is  already  available  from  (7)  and  (8),  namely, 

EjchC+i(l-C)2(l+f)  ,  (29) 

or 

Ej§2}  S  S2  +-^-(l-S2)2  (l  +-2|-)  .  (30) 

Therefore,  if  we  can  approximate  Ejs}  or  Bias  (S)  or  Variance  (S)  in  (26) 
and  (27»,  we  will  have  approximations  for  all  three  statistics  in  (26»-(28). 
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Initial  attempts  concentrated  on  approximating  the  bias  (26)  by  the  form 

2 

(ri~_S2  *  >>3  ,  (31) 

where  a  and  v  were  chosen  so  as  to  match  the  exact  bias  and  its  derivative  at 
S  =  0;  these  attempts  were  not  successful  for  all  N  and  S.  A  generalization  to 
the  form 


ii^Ml-S)'[a*JlS2(S-X>],  X*  ->i  ,  (32, 

was  quite  good  for  N  up  to  100,  but  deteriorated  for  larger  N,  despite  also 
matching  the  exact  second  derivative  of  the  bias  at  the  origin.  Numerous  other 
forms  were  tried  for  approximating  the  bias  but  yielded  poorer  approximations. 

VARIANCE  APPROXIMATION 


Succeeding  attempts  were  aimed  at  approximating  the  variance  (27).  It  will 
be  recalled  (from  the  discussion  under  (16))  that  (27)  is  an  even  function  of  S. 
(This  even  property  is  not  true  of  (26)  or  (28),  because  of  the  S  term  in  (26). ) 
The  approximation  to  the  variance  was  therefore  also  chosen  to  be  even;*  after 
much  trial  and  error,  an  acceptable  form  was  found  to  be 


2  2 
n 

Variance  (S)  =  '  ' 


2(N  -  2) 


1-|(1-sVa 


(1-S2)5 


-  „2 


=  o 


app 


(33) 


The  leading  term  in  (33)  is  dictated  by  (25);  the  second  term  in  the  bracket  was 
deduced  from  observing  the  numerical  values  of  the  variance  near  S  =  1;  and 
the  numerator  of  the  third  term  is  chosen  to  make  it  decay  faster  than  the  other 
two  terms  near  S  =  1.  Equation  (33)  already  matches  the  value  and  derivative 
of  the  exact  variance  at  S  =  1,  and  the  three  constants  were  chosen  so  as  to 
match  the  value  and  first  four  derivatives  of  the  exact  variance  at  S  =  0;  see 
appendix  C.  The  end  result  of  the  investigation  is  that  the  constants  are  given  by 


A  =  -0.571 


1.75  0.760 

N  +  N2 


B  =  0.752  N  -  3.26 

D  =  0.221  N2  -  1.66  N  .  (34) 

•See  reference  12,  pages  108  and  118. 
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Plots  of  the  exact  variance  (27)  and  the  approximate  variance  (33)  are  pre¬ 
sented  in  figure  4  for  N  =  8,  16,  64,  256,  and  1024.  (Notice  that  the  abscissa 
is  S,  not  C. )  The  discrepancy  does  not  go  to  zero  as  N  increases,  as  it  did 
for  the  MSC  approximation;  however,  the  discrepancy  is  small  over  the  practical 
range  of  values  of  N  (i.  e. ,  N  <  1000),  where  N  is  the  number  of  averages 
employed  in  the  MC  estimate. 

The  peak  of  Variance  (S)  occurs  at 

s  {l  -  W *§)  £or  64  N i 1024  <35) 

and  is  of  value 

-  0.49  10  290 

Peak  Variance  (S)  S — - -  + — —  for  64  <  N  <  1024  .  (36) 

N  N2  N3  ~  " 

These  results  follow  by  fitting  the  exact  numerical  results  in  figure  4.  For  very 
large  N,  (36)  suggests  that  the  peak  variance  approaches  (2N)”1. 


Figure  4A.  N  =  8 

Figure  4.  Variance  of  MC  Estimate 
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Figure  4D.  N  =  256 


o  s 

Figure  4E.  N  =  1024 


Figure  4  (Cont'd).  Variance  of 
MC  Estimate 
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At  the  origin,  we  have,  from  (15)  and  (16), 


Variance  (S)  = 


_1_  rr(N)r(3/2)~ 

N  “  [r(N  +  l/2)_ 


2 


~  (\  _L  —  _L  °-215  Q.  196 
=  \  "  4 )  N  ■  16  n2  -  N  -  n2  • 


(37) 


Here,  we  have  employed  the  approximation  (reference  7,  equation  6. 1. 47) 


r(N)r(3/2)  ,  n/T/2  /  l\ 

r(N  +  l/2)  =  v/N  V  8N/ 


(38) 


which  is  excellent  even  for  N  as  small  as  2. 

BIAS  APPROXIMATION 

If  we  eliminate  e|§}  from  (26)  and  (27),  and  then  employ  (30)  and  (33), 
we  can  express 


Bias  (§)  =  [e{§2}  -  Variance  (§)J 1//2  -  S 


This  approach  is  in  line  with  the  observation  made  under  (30).  The  approximate 
variance  ®fpp  in  (39)  is  given  by  (33)  and  (34). 

Plots  of  the  exact  bias  (26)  and  the  approximate  bias  (39)  are  presented  in 
figure  5  for  N  =  8  and  16.  The  exact  bias  decreases  monotonically  with  S  and 
has  an  origin  value  of 


Blasts ,  (40) 

from  (15);  an  excellent  approximation  to  is  given  in  (38).  The  discrepancy 
between  (26)  and  (39)  is  barely  discernible  for  N  =  8  and  is  not  discernible  for 
N  =  16  up  through  N  =  1024. 
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Figure  5 A.  N  =  8 


Figure  5B.  N  *  16 
Figure  5.  Bias  of  MC  Estimate 
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MEAN-SQUARE  ERROR  APPROXIMATION 

The  approximation  to  the  mean-square  error  is  immediately  available 
via  (28): 


Mean-Square  Error  (S)  2  b|pp  +  e|pp  , 


(41) 


where  the  approximate  bias  and  variance  are  given  by  (39)  and  (33),  respectively. 
Plots  of  (28)  and  (41)  are  presented  in  figure  6  for  N  =  8,  16,  64,  256,  and  1024. 
The  discrepancy  does  not  go  to  zero  as  N  increases;  however,  it  is  small  over 
the  range  of  practically  useful  values  of  N. 

The  peak  value  of  the  mean-square  error  occurs  at  S  =  0  and  is  of  value 
1/N,  as  is  seen  from  (16).  The  mean-square  error  curve  is  composed  of  two 
distinct  regions,  one  near  the  origin  where  the  bias  dominates,  and  one  for 
larger  S  where  the  variance  dominates;  this  explains  the  hump  in  the  curves 
for  larger  N. 


Figure  6 A.  N  =  8 

Figure  6.  Mean-Square  Error  of  MC  Estimate 
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Figure  6D.  N  =  256 


Figure  6  (Cont'd).  Mean-Square 
Error  of  MC  Estimate 
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SUMMARY 


Approximations  for  the  MSC  estimate  are  given  by 

Bias  <C)  (1-C)2 

Vartttce(e)3J^a-C,2[2C.i-^] 

Mean-Square  Error  (C)  2  -  2  (1-C)2  C  +  1  ~  5CJ"—C  1 

N  +  1  L  N  J 


Approximations  for  the  MC  estimate  are  given  by 


Variance  <§)  1  - 1.  (1-S*)  +  A  ■  ^  =  -tpp  ,  (33, 


where 


„  1.75  0.760 

A  =  -0.571  +  —  +  — 


B  =  0.752  N  -  3.  26 


C  =  0.221  N2  -  1.66  N 


&«[s2+-±-<i-s2>2  (i  -  «ipp] 


■  S  5  ba 


*  2  o 

Mean-Square  Error  (S)  2  b|pp  +  <r£pp  . 
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All  of  these  are  capable  of  hand  calculation  over  the  entire  range  of  true 
coherence.  The  approximations  for  the  MSC  estimate  are  particularly  simple; 
those  for  the  MC  estimate  are  somewhat  more  complicated,  but  far  more 
tractable  than  the  exact  answers  involving  a  3F2  function.  The  fundamental 
dependencies  of  the  statistics  on  N  and  true  coherence  have  also  been  deduced. 
Although  the  discrepancies  between  approximations  and  exact  values  do  not  tend 
to  zero  for  the  MC  statistics  for  large  N,  the  approximations  are  useful  at 
least  over  the  range  from  N  =  8  to  N  *  1024,  which  is  believed  to  encompass 
the  region  of  most  practical  interest.  How  good  the  approximations  are  for 
larger  N  has  not  been  investigated  quantitatively. 
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Appendix  A 

REDUCTION  OF  THE  3F2  FUNCTION 
From  (4)  in  the  main  text,  we  have 

M2“N(irri)(1“C)N3F2(3.  N,  N;N  +  2,  1;  C)  , 


(A-l) 


which  is  very  slowly  convergent  for  C  near  1.  Now,  in  (A-l),  using  reference  6, 
section  9. 14,  we  have 


But 


^  <3)k  <N>k  ck 

3F2(""'  £*0  U>k(N*2>„  Tr(N’k 


(3,k  (k*l)(k  +  2) 

<Dk  ’  2 


and 


(N)k 

(N +  2)k  (N  +  k)(N  +  k+l)  • 


N(N+1) 


Substituting  (A-3)  and  (A-4)  in  (A-2)  and  (A-l)  yields 


(A— 2 ) 


(A-3) 


(A-4) 


M  =(1-C)N  Y  (k^1)(k^2)  C  gu 
2  (i  (k  +  N)(k  +  N  + 1)  k!  (N)k  * 

Now,  a  partial-fraction  expansion  yields 

(k+  l)(k  +  2)  _  (N  -  1)(N  -  2)  N(N  -  1) 

(k  +  N)(k  +  N  +  1)  +  k  +  N  ”  k  +  N  +  1  ; 

and,  since  we  can  express 

1  i  <N>k 


k  +  N  N  (N  +  l)k  ' 
1  1  (N  +  1)k 


k  +  N  +  1  N  +  l  (N  +  2)k  ’ 


(A-5) 


(A-6) 


(A-7) 


A-l 
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(A-5)  takes  the  form 


q-c)”lf:  £i(N)t  T  <Nlk.1-k-  £i 

H  jfco  k!  k  N  i£o  (N  +  1,k  k! 

N(N-l)  ^  <N)k(N>l)k  ck 

'  N-l  fa  (N +  2)k  k! 


(A-8) 


=  (l-C)^  F(n,  b;  b;  C) 


(N  -  1)(N  -  2) 


F(N,  N;  N  1;  C) 


N(N-l) 


F (N,  N  +  l;  N>2;  C)  , 


(A- 9) 


upon  employment  of  reference  7,  equation  15. 1. 1.  By  use  of  reference  7,  equa¬ 
tion  15.  3.  3,  this  can  be  manipulated  into  the  form 

„2„  !-f  (N-1MN-2)- (1-C)  F(l,  1;N  +  1;C) 


N(N  -  1) 


(1-  C)  F (2,  1;  N  +  2;  C)  . 


(A- 10) 


which  is  particularly  good  for  developing  in  a  series  in  (1-Ci  by  use  of  refer¬ 
ence  7,  equation  15.  3, 11. 

At  this  point,  a  multitude  of  alternative  forms  for  (A-10)  are  available  by 
use  of  reference  7,  page  558.  Several  rapidly  convergent  forms  involving  a 
single  F  function  are  now  listed: 

a 2  =  -  n3  ~  2n2  *.2P."  2  +4-T  [n2  -  (N  -  2)  cl  F(l,  1;  N  +  2;  C)  (A-ll) 

•  N  N  +  l  L  J 

2 _  (N  -  1)(N  -  2)  r 


N(N  +  1)  N  +  l 


=  1  + 


ii^ii  [N2-(N-2,C]CF(2,  1;  N  +  3;  C)  (A-12. 

*  ,1-C,  j(NH)(N-2)  -  [n2  -  (N-2|  Cj  F(2,  lj  N*2;  C)J  .  (A-13) 


A- 2 
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The  form  in  (A-ll)  uses  exactly  the  same  F  function  as  encountered  in 
and  is  more  rapidly  convergent  than  the  latter  two  forms,  for  all  values  of 

The  reduction  technique  employed  above  for  m  =  2  in  (4)  can  also  be  used 
for  other  integer  values  of  m.  However,  it  fails  for  m  noninteger,  because 
simplifications  like  (A-3)  and  (A-4)  do  not  occur  then. 


A-3/A-4 
REVERSE  BLANK 
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Appendix  B 

EXPANSION  ABOUT  S  =  1  FOR 
MAGNITUDE  COHERENCE 

The  estimate  of  MSC  is  given  in  (3).  We  let 

Xn(f)  =  +  iSq)  - 

Yn(f)  =  g(an  +  ibn)  +  (cQ  +  idn)  s  gaQ  +  pQ  ,  (B-l) 

where  an,  bn,  cn,  dn  are  independent,  zero-mean,  unit-variance,  real, 
Gaussian  random  variables.  Then,  for  g  real, 

E{xn(f)  Yn(f)}.  =  E|an  g(c«*  +  P*)}  =  gE{|an|2J  =  2g  ; 

E|jXn(f)j2}  =  2;  E  {|Yn(f)| 2}  =2(l  +  g2)  .  (B-2) 

Therefore,  the  MSC  is 

r-  <2S)2  _  g2 

2  x  2(1 +  g2)  1  +  g2  (B’3) 

For  a  specified  value  C  of  the  MSC,  the  required  value  of  scale  factor  in 
(B-l)  is 

-teT- 

Thus,  as  S-*l,  C  *  S2  *♦  1,  g -•  *>,  and  l/g-*0.  Because  we  are  interested  in 
S  near  unity,  we  can  concentrate  on  l/g  near  zero. 

If  we  define 

N  N  N 

n=l  n=l  n=l 

then  substitution  of  (B-l)  in  (3)  yields 


B-l 
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C  = 


N 


L  an(8<+^n) 

n=l 


d+kaI2 


£  N2  £  l*“»*  ftJ*  A(B+2gDr  +  g  A) 


N  N 

|an|2  _ 
n=l  n=l 

|D  |2  +  2gADr  +  ^A2 

AB  +  2gADr  +  g2A2 


(B-6) 


where  Dr  is  the  real  part  of  D  in  (B-5).  Rearranging  (B-6),  we  obtain 


C  =■ 


1  +—  T  +-4-U 

g  g2 

1  +— T  +-Vv 

g  g2 


(B-7) 


where 


2Dr  |D|2  £ 

T=—  u  ~~72~’  v  ~  a  ’ 

A 


(B-8) 


Now  a  series  expansion  of  (B-7)  in  powers  of  1/g  (as  noted  under  (B-4))  yields 


a„  a„  a,  a.  a,, 

r-i  ~  —  4  ,  J  6 

C_1+  2  3  4  5T  6 

g  g  g  g  g 


(B-9) 


where 


a2  =  U-V,  a3  =  -  (U  -  V)T,  a4  =  (U  -  V)(T2  -  V), 
a5  =  (U- V)(2V-T2)T,  a6  =  (U  -  V)(T4  +  V2  -3T2V) 


(B- 10) 


Since  we  are  interested  in  the  behavior  of  the  MC  estimate  S,  we  employ 
the  expansion 


(1  +  €)1/2  =  !  -  |-*2  +  ~«3  +  .  .  .  (1  «  I  <  1) 


(B-ll) 


B-2 
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to  obtain 


g-iU 


(H  a2a3\  1  4  4jj_ 

+  \2“'  4  /^+\2  8“  T  +  16/g6  + 


And,  since  we  are  interested  in  S  near  unity,  we  let 

x  =  1  -  C 


(B-12) 


(B-13) 


and  expand  S  in  a  power  series  in  x.  To  do  this,  we  utilize  (B-4)  and  obtain 

r  Vr^=*;l/2(i4,+!x2  +  ---) 


g 

l 


-=  X  +  x^  +  xJ  +  .  .  . 


g‘ 


-7=x3/2  i1  +lx  "Tx2  +  •  •  •) 

s 

— =  x2  (1  +  2x  +  3x2  +  ...)  =  x2  +  2x3  + 
g4 

Ts5/2(“f--) 


g 

it6 


=  x3  + 


«  «  • 


Substitution  of  (B-14)  in  (B-12)  yields 

§  =  1+x(|a2)  +x3/2(|a3)  +x2(la2+Ia4-|a2 
+  x5/2(|a3  +ia5  -i-a2a3) 

+  x3(j  a2  +  a4  “|a2  +  |a6  -ia3.  “7a2a4  +  jJa2j  + 


(B-14) 


.  (B-15) 


B-3 
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Now  we  are  ready  to  perform  averages  on  the  individual  terms  in  (B-15)  and 
obtain  an  expansion  of  E  j§}  in  powers  of  x  =  1  -  C. 

The  method  of  obtaining  E  will  be  developed  in  full.  The  results  for 
the  other  averages  in  (B-15)  will  merely  be  stated,  and  can  easily  be  deduced 
from  the  method  presented.  From  (B-10),  (B-8),  and  (B-5), 

2  N 

*2JU-V=---— £  PmPnQmn.  (B-16) 

hr  m,  n=l 

where  we  have  defined 

Qmn  =  am  an  “  A  8mn  •  (B-17) 

Now,  let 

Z  5  [a!  a2  •  *  •  ®n]  •  (B-18) 


Then,  since  Qmn  depends  only  on  or, 


E{u.v|a}.-i- 

N 

Qmn  E  )Pm  Pnf 

A  * 

m,  n=l 

Qnn  =  4 '  LA  -  ^ 
A2 


2  (N  -  1) 


(B-19) 


where  we  have  utilized  the  property 

E{PmPn}  =  26mn  *  (B-20) 

which  follows  directly  from  the  definitions  (B-l).  Therefore,  using  (B-19),  we 
have 

EjU-V}  =  -2(N-1)  Eji-J  .  (B-21) 

Now,  A  is  given  by  (B-5)  and  (B-l)  as 

N 

A  =  £  (^  +  5n)  •  (B-22) 

n=l 
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Therefore,  the  probability  density  function  of  A  is 


P(A)  = 


AN-1  exp(-A/2) 
2N  (N  -  1) ! 


A  >  0  . 


(B-23) 


There  follows  immediately  the  m-th  moment  of  l/A  as 


E|l/Am}  =  — 


m  (N  -  1)(N  -  2)  .  .  .  (N  -  m)  ’ 


m  <  N  .  (B-24) 


Employing  (B-24)  in  (B-21),  we  have 

E  ja2 }  =  E  ju  -  V }  =  -  1  . 


(B-25) 


By  employing  the  generalizations  of  (B-20)  to  the  fourth  and  sixth  orders, 
namely, 


EKPjPm^  =  4(6U6mn  +  61o1tilm)  • 

E  K  Pl'  Pm  pn  pp  pq  }  ‘  8  (6ld  6mp  6pq  *  6U  6mq  «np 

+  Pknp£mppq  +  ®kn  ®mq  +  ®kq  ®im*np  +  *kq  fyip^mn)  < 
we  find  the  following  quantities: 


M 

44  s 

N 

N-  2  ’ 

M 

=  o,  E|a2a3|  =  0,  E^}  - - 

N-l 

N-3  ’ 

/a2l 

2N  E  'a  a  *  - 

N2 

)a3( 

(N  -  2)(N  -  3)  ’  |  2  4f 

(N-  2) (N-3)  ’ 

H) 

N  (N  + 1) 

(N-2)(N-3)  * 

(B-26) 


(B-27) 


B-5 
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When  we  employ  (B-27)  in  (B-15),  there  follows 


E{S  |  =  1  “ilry  (1_C)2 


1  N2  -  7N  + 16 
16  (N-  2)(N  -  3) 


(1-C)3  +  .  .  . 


This  is  the  end  result  quoted  in  (23)  in  the  main  text. 


(B-28) 


< 


Appendix  C 

VARIANCE  APPROXIMATION  FOR 
MAGNITUDE  COHERENCE 

From  (19)  and  (40)  in  the  main  text,  we  have 

E|sf=Q0  +  Q1S2  +  Q2S4  +  .  .  .  , 

where 


Qo  “  gN 


Ql  -  Gn 


N(N-l) 
2N  +  1 


N  (N  -  1)  (6  +  N  -  N2) 
^2  N  4(2N  +  l)(2N  +  3) 


And,  from  (18),  we  have 


E{s2|  =  R0  +  Rx  S2  +  R2  S4  +  .  .  .  , 


where 


Therefore, 


Ro  = 


_i 

N 


N-  1 
N  +  l 


Ro  = 


N-  1 


2  ~  (N  +  1)  (N  +  2) 


Variance  (S)  =  a  +  pS2  +  VS4  +  .  .  .  , 


where 

«  =  R0  -  <4 

P=R1-2Q0Q1 
Y  =  R2  -  Q2  -  2QqQ2  . 
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(C-l) 


(C — 2 ) 


(C-3) 


(C-4) 


(C -5) 


(C-6) 


C-l 
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By  use  of  (40)  and  reference  7,  equation  6. 1. 47,  we  find 

_  _  n/7/2  f  1  1  1 

N  ~  nW  [  SN  *  128N2  +  •  •  -J  • 

Expanding  the  above  expressions  in  powers  of  N”*,  we  find 

/  «\  1  n  1  65*  1 

“  "  V  ”  4/  N  '  16  $2  *  16384  ^3  +  ‘  ‘  ‘ 

8  =  1-—-  (2  -  — )  —  + 

P  4  V  16/ N 


„  #  7 w 

V""32N  +  l28+‘‘*  - 


(C-7) 


(C-8) 


Thus,  (C-5)  and  (C-8)  give  a  power  series  expansion  of  Variance  (§)  that  should 
be  accurate  for  large  N. 

The  variance  approximation  that  we  adopt  is  given  in  (33).  We  expand  (33) 
in  powers  of  S2  and  obtain 

•w  ■  izhi;  [J1 4  *  A|  - s2|f  -  AlB*2>  - 2 11 4 '  A)j 

-  4  Ja((B-1i2-D)  -  2(|-  -  A(B  +  2>)  +  (l  +  a)|  +  .  .  .  .  (C-9) 

We  now  select  constants  A,  B,  and  D  so  that  (C-5)  and  (C-9)  match  up  through 
the  power  4 .  There  follows 


A  2  (N  -  2)  a  -  1  — 


-  2  (N  -  2|(2a  v  P)J  - 


D  -  (B  *  li2  -  -J-  2  (N  -  2)  (3a  -  2(3  -  V  > 
A 


tC  - 10) 


C  -  2 
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We  now  employ  the  expansions  for  a,  |3,Y  in  (C-8)  and  obtain,  finally, 

A  =  -0.  57080  +  1. 7489/N  +  0.  76047/N2  +  .  .  . 

B  =  0.  75194N  -  3.  2639  +  .  .  . 

D  =  0.  22142N2  -  1.  6648N  +  .  .  .  (C-ll) 

Equations  (33)  and  (C-ll)  are  the  final  results  for  the  variance  approximation. 

It  has  been  found  sufficient  to  retain  only  three  decimals  in  the  constants  and  to 
stop  with  the  terms  shown  in  (C-ll). 
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Bad  Data  Points,  Via 
Maximum  Entropy  and 
Linear  Predictive 
Techniques 

Albert  H.  Nuttall 


ABSTRACT 

A  comparison  of  several  methods  for  spectral  esti¬ 
mation  of  a  univariate  process  with  equi -spaced 
samples,  including  maximum  entropy,  linear  predictive, 
and  autoregressive  techniques,  is  made.  The  com¬ 
parison  is  conducted  via  simulation  for  situations 
both  with  and  without  bad  (or  missing)  data  points. 

The  case  of  bad  data  points  required  extensions  of 
existing  techniques  in  the  literature  and  is  docu¬ 
mented  fully  here  in  the  form  of  processing  equations 
and  FORTRAN  programs.  It  is  concluded  that  the  maximum 
entropy  (Burg)  technique  is  as  good  as  any  of  the 
methods  considered,  for  the  univariate  case.  The 
methods  considered  are  particularly  advantageous 
for  short  data  segments. 

This  report  also  reviews  several  available  techniques 
for  spectral  analysis  under  different  states  of  knowl¬ 
edge  and  presents  the  interrelationships  of  the  various 
approaches  in  a  consistent  notation.  Hopefully,  this 
non-rigorous  presentation  will  clarify  this  method  of 
spectral  analysis  for  readers  who  are  nonexpert  in  the 
field. 
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SPECTRAL  ANALYSIS  OF  A  UNIVARIATE  PROCESS  WITH 
BAD  DATA  POINTS,  VIA  MAXIMUM  ENTROPY  AND 
LINEAR  PREDICTIVE  TECHNIQUES 


1.  INTRODUCTION 

The  analysis  of  power  density  spectra  of  random  processes  via  maximum 
entropy,  linear  predictive,  and  autoregressive  techniques  has  attracted  much 
attention  recently,  especially  for  short  data  segments.  In  particular,  a  good 
review  article  (reference  1)  recently  appeared  in  which  115  references  are 
listed  on  the  topic  of  linear  prediction.  Another  good  paper  on  this  method  of 
spectral  analysis  (including  a  comparison  of  techniques)  is  available  in  refer¬ 
ence  2,  where  66  references  are  cited.  Additional  related  references,  that  this 
author  is  aware  of,  are  given  in  references  3  through  15  of  this  report.  The 
close  links  that  exist  between  maximum  entropy  spectral  analysis  (MESA) , 
autoregressive  spectral  analysis,  predictive  error  filters,  noise-whitening  fil¬ 
ters,  and  least-squares  model  building  are  pointed  out  very  well  in  reference 
14. 


The  purposes  of  this  report  are  to  review  and  interrelate  several  available 
techniques  for  spectral  analysis  under  different  states  of  knowledge,  for  equi- 
spaced  samples,  in  a  consistent  notation;  collect  and  compare  the  techniques 
via  simulation  in  order  to  determine  the  best  available  technique(s);  and  extend 
the  best  technique(s)  to  handle  the  case  of  bad  (or  missing)  data  points  and  com¬ 
pare  them  via  simulation.  The  only  detailed  comparison  of  techniques  for  no 
missing  data  points  available  thus  far  in  the  literature  is  that  in  reference  2, 
where  the  Burg  technique  and  the  Yule-Walker  approach  are  compared.  Here 
we  will  extend  the  comparison  to  include  the  Burg  technique,  the  Yule-Walker 
approach,  an  unbiased  version  of  the  Yule-Walker  approach,  the  approximate 
maximum  likelihood  and  least-squares  approaches  of  reference  16,  the  auto¬ 
correlation  and  covariance  approaches  of  reference  1,  and  an  extended  version 
of  the  covariance  approach.  (A  comparison  with  the  maximum  likelihood  tech¬ 
nique  is  reserved  for  a  future  report.)  Also,  we  will  compare  the  best  of  these 
approaches  for  the  case  of  bad  (or  missing)  data  points  and  present  FORTRAN 
programs  for  the  recommended  techniques. 

Throughout  this  report,  we  assume  we  are  dealing  with  equispaced  samples 
of  a  stationary  zero-mean  random  process  x(t);  that  is,  xn  =  x(nA),  where  A  is  the 
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sampling  interval  in  time.  In  section  2,  we  will  assume  that  the  correlation 
function  of  the  sampled  process,  jxn}  ,  namely,  * 

R.  =x  x*  =  R*  ,  (1) 

K  n  n-k  -k 

is  known  exactly  for  all  k,  and  shall  present  two  alternative  equations  to  deter¬ 
mine  the  spectrum  of  |xn[  ;  the  latter  of  the  two  equations  serves  as  a  guide  to 
the  MESA,  linear  predictive,  and  autoregressive  approaches.  In  section  3,  it 
will  be  assumed  that  R^  is  known  only  for  a  limited  range  of  values  of  k,  and 
three  alternative  approaches  will  be  considered  and  shown  to  lead  to  identically 
the  same  spectral  approximation.  Next,  in  sections  4  and  5,  the  practical 
problem  of  an  unknown  correlation  function  and  only  a  finite  data  set  of  |xn|  , 
n  =  1,  2,  ...,  N,  some  of  which  may  be  bad,  will  be  addressed,  and  several 
candidate  techniques  for  spectral  estimation  will  be  presented.  Finally,  a  com¬ 
parison  of  the  techniques,  via  simulation,  will  be  conducted  and  conclusions 
drawn  regarding  the  best  available  technique,  both  with  and  without  bad  data 
points.  FORTRAN  programs  for  the  best  technique  for  both  situations  will  also 
be  presented. 


*The  case  of  complex  samples  is  treated,  so  that  we  can  handle  complex 
envelope  or  complex  demodulated  processes.  Specialization  to  real  processes 
is  immediate,  and  (1)  becomes  =  R-k-  An  overbar  indicates  an  ensemble 
average. 
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2.  CORRELATION  KNOWN  EXACTLY  FOR  ALL  ARGUMENT  VALUES 

Suppose  the  correlation  function  in  (1)  of  process  |xnj  is  known  for  all  k. 
The  standard  (double-sided)  definition  of  the  spectrum  of  |xn|  is  then  (see,  for 
example,  reference  14,  equation  (10) ) 


1 

Gx(f)  =  A  £  Rk  exp(-i2rfkA),  ifi  <  — .  (2) 

k=-CD 

Gx(f)  is  real  and  nonnegative,  but  need  not  be  even  in  frequency  f  for  complex 

lRkl- 

2.  1  LINEAR  PREDICTION  BASED  ON- 
INFINITE  PAST 

Suppose  that  sample  values  x^.^,  xj^,  . . .  are  available  and  are  used  to 
linearly  predict  the  value  of  x^.  Then  the  one -step  predicted  value,  based  on  the 
infinite  past,  is  (for  a  zero-mean  process) 


A 

X. 


CD 


*  £ 


n=  1 


a  x 
n  k- 


(3) 


The  values  of  the  complex  predictive  filter  coefficients  ja^}  i  are  chosen  such 
that  the  one-step  prediction  error 


c 


a  x 
n 


k-n 


(4) 


has  minimum  ensemble  average  magnitude-squared  value.  Figure  1  depicts  the 
interrelationships . 

The  ensemble  average  magnitude- squared  error  is,  employing  (1),  given  by 

- ?  ® 

E  =  It.  I  =  L  a  a*  R  .  (5) 

Ik  ^.mnn-m 

m,  n=  0 

For  a  minimum,  we  first  compute  (see  reference  17,  appendix  A) 
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WHITENING  FILTER  (c>0<  a,,  a^.*) 

Figure  1.  Block  Diagram  of  Predictive  and  Whitening  Operations 


=  y  a  j2  >  1 

*  ~  n  I- m  m  ’ 
daf  m  =  0 


and  set  it  equal  to  zero,  obtaining  the  optimum  predictive  filter  coefficients 
as  the  solution  of  the  set  °*  equations" 

£  R  a  =  0,  1  >  1  (a  =  a  =  -  1)  . 

n  f-  m  m  o  o 

m=  0 

The  minimum-error  sequence  then  possesses  correlation 


E.  =  7,  TJ  .=  T  a  a*  x  x*  . 
j  k  k  -  j  L~‘_  n  mn  k-m  k-j-n 


m,  n=  0 


T  a  a*  R.  =  L  a*  Y  R. 

.  mn  j+n-m  ^  n  *—  j+n- 
m,  n=  0  n=  0  m=0 


a  , 
m  m 


•The  same  result,  (7),  can  be  obtained  by  setting  the  partial  derivatives  of 
E,>  with  respect  to  the  real  and  imaginary  parts  of  af,  equal  to  zero. 
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where  we  have  employed  (4)  and  (1).  Now  the  innermost  sum  on  m  in  (8)  is  0 
for  j  +  n  >1,  by  (7).  And  if  j  >1  in  (8),  then  j  +  n  >  1  since  n  >  0  in  the  outer¬ 
most  sum  in  (8).  Therefore,  Ej  =  0  for  j  >1.  Also  since  E_j  =  E*,  we  have 

E.  =  0  for  j  /  0  ;  (9) 

that  is,  the  minimum-error  sequence  is  uncorrelated  and  therefore  pos¬ 

sesses  a  white  spectrum.  The  linear  filter  characterized  by  coefficients 
i  J  q'  is  a  whitening  filter;  see  figure  1. 

The  correlation  of  {7^}  for  zero  time  delay  is  the  power  of  the  minimum 
error  and  is  given  by 


E 

o 


L 


n=  0 


If* 

n 


E 


m  =  0 


R  a 
n  -  m  m 


U) 

=  a*  y  R  a 
o  -m  m 

m=  0 


R 

o 


£ 


m  =  1 


R*  a 
m  m 


(10) 


where  we  have  used  (8),  (7),  and  (1).  The  spectrum  of  |7^|  is  therefore 
(using  (9) ) 


G-(f)  =A  y  E.  exp(-i2  »fjA)  =  aEo>  ifi  <  £,  (11) 

j=—  J 

which  is  white,  as  mentioned  above. 

But  since  the  error  sequence  is  given  by  a  linear  transformation  of  process 
|  according  to  (4)  and  figure  1,  the  spectrum  of  {7^}  is  given  by  the  stand¬ 
ard  linear  filter  relation 


G -(f)  =  |A(f)|2Gx(f)  , 

where 


(12) 


U» 

A(f)  =  y  a  exp(-i2  ffnA) 
n=  0 


(13) 


o 
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is  the  transfer  function  of  the  whitening  filter  and  is  assumed  to  be  stable.* 
Combining  (11)-(13),  we  obtain  an  alternative  expression  to  (2)  for  the  spectrum 
of  as 


Gx®“ 


AE 

o 


£  a  exp(- i2?rfnA) 
n=  0 


(14) 


Given  the  correlation  values  jR^}  .  utilization  of  (14)  requires  solution  of  the 
set  of  equations  in  (7)  for  the  filter  coefficients  and  subsequent  substitu¬ 

tion  in  (10)  and  (14).  Although  this  is  not  a  practical  alternative  to  (2)  in  this 
case,  it  does  serve  to  indicate  that  there  is  possibly  some  potential  in  the  idea 
of  determining  predictive  filter  coefficients  to  minimize  the  average  magnitude- 
squared  one- step  prediction  error  and  thereby  obtain  a  white  spectrum;  this 
idea  will  prove  to  be  quite  fruitful  later  on. 

As  an  aside,  if  we  allow  a_^  4  0  in  (3)  and  minimize  |«k}“>  we  find  4  0, 
although  Ej  =  0  for  j  >2.  Thus,  the  minimum-error  sequence  would  not  be 
white,  and  a  convenient  expression  like  (14)  would  not  result. 


It  should  also  be  noted  that  the  crosscorrelations  between  the  minimum- 
error  sequence  j«k|  and  all  past  values  of  the  input,  jx^}  ,  are  zero;  this 
follows  by  use  of  (4),  (1),  and  (7). 


2.2  LINEAR  PREDICTION  BASED  ON 
INFINITE  FUTURE 


If  sample  values  Xk+i»  xk+>’  •••  3X6  available  and  are  used  to  linearly 
'■predict"  the  value  of  x^  according  to  a  backward  regression  (that  is, 
combine  future  values), 


a* 

n 


k  +  n  ’ 


(15) 


•That  is, 
plex  z-plane. 


V 

n=  0 


—  -n 
a  z 
n 


has  all  its  poles  inside  the  unit  circle, 


O. 


in  the  com- 


6 


TR  5303 


then  the  one-step  error 


=  x,  -  x 


CD 

*\?o 


a’x, 
n  k  +  n 


(ao=_1) 


has  average  magnitude- squared  value 


L 

m,  n=  0 


a  a*  R  , 
m  n  n  -  m 


(16) 


(17) 


which  is  identical  to  (5).  Thus,  the  same  optimum  filter  coefficients  in  (7)  that 
minimized  (5)  would  also  minimize  (17).  The  minimum-error  sequence  in  (16) 
would  also  be  white,  and  an  expression  for  the  spectrum  of  jx^}  identical  to 
(14)  would  result.  The  point  of  this  result  is  that  an  equivalent  expression  for 
the  spectrum  of  jx^j  is  obtained  by  the  backward  regression  (15),  rather  than 
the  forward  regression  (3)  of  the  preceding  subsection.  This  idea  will  prove 
useful  later  when  we  have  to  deal  with  finite  data  sets  and  unknown  correlation 
functions. 


The  crosscorrelations  between  the  minimum-error  sequence  and  all  future 
values  of  the  input  are  zero;  this  follows  by  use  of  (16),  (1),  and  (7). 

2.3  LINEAR  INTERPOLATION  BASED 
ON  INFINITE  PAST  AND  FUTURE 


If  we  attempt  to  combine  the  approaches  of  the  previous  two  subsections, 
we  are  led  into  considering  linear  interpolation  according  to 


The  error 


Ul> 


n  = 


n^O 


£ 


n  =  -<d 


a  x 
n 


k-  n 


(16) 


(19) 
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has  average  magnitude-squared  value 


£ 


L 

m,  n=  -ao 


a  a* 
m  n 


R 

n 


m 


(20) 


using  (1).  Setting  <JE/da*  =  0  for  4  /  0,  we  obtain  for  the  optimum  filter  coeffi¬ 
cients 


£  R,-m 


a 

m 


0,4/0 


fa  =  a  =  -  1)  . 
o  o 


(21) 


There  follows,  by  use  of  (1), 


(22i 


The  correlation  of  the  minimum-error  sequence  J*"^}  is  now 


E 

J 


* 

k 


I 

m,  n=  -<u 


a  If*  R 
m  n  j  n  -  m 


R 

j  -  n  -  m 


L 


m  =  -■«> 


a  R 
m  -  m 


a  E  ,  (2 3 1 
J  u 


where  we  have  employed  (19),  (1),  (21),  and  (22).  It  is  generally  nonzero  lor 
j  /  0.  The  spectrum  of  the  minimum-error  sequence  is  therefore 


G;(f)  =  -  .IE 

*  o 


exp  ( -  i2 


-iE  A,l), 
o 


(24> 


where  we  have  used  (23  >  and  assumed  A  if)  to  be  stable.  This  spectrum  is  not 
white,  in  fact,  employing  (12),  (24)  can  be  expressed  as 


G  ”(f) 


A2 

o 


G  (f)  ’ 
x 


which  is  the  inverse  o!  the  input  spectrum. 


(25) 
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If  we  instead  eliminate  G7(f)  from  (12)  and  (24),  we  obtain  an  expression 
for  the  input  spectrum  in  terms  of  filter  A(f)  in  (24)  as 


oxm  =  - 


ae 

_ _ 0_ 

A*(f) 


dE  i 

— 2 

A (f )  *  2a’ 


(26) 


the  realness  of  A(f)  follows  from  (22). 


There  is  an  uncorrelated  property  between  the  minimum  error  and  the  input 
in  the  present  case  also.  Namely,  the  crosscorrelation  between  the  minimum- 
error  sequence  and  the  input  is 


C 

J 


*-  \*i-> 


y*  a  x,  X 
n  k-  n 

n=  -<n 


R 

J  -  n 


=  E  4 
o  oj 


(27) 


using  (19),  (1),  (21),  and  (23).  Thus,  the  minimum-error  sequence  is  uncorre¬ 
lated  with  all  past  and  future  values  of  the  input  except  at  the  same  time  instant. 
The  cross- spectrum  is 


G.  (fi  ;  l  C  exp(- i2*fj  A)  =  AE  , 

« x  j  o 


2A  ’ 


(28) 


which  is  white. 

Although  (26)  and  (21)  offer  an  alternative  to  (14)  and  (7)  in  the  present  case 
of  known  correlation  function  {Rg(  ,  it  suffers  in  the  practical  case  of  unknown 
correlation  and  a  finite  data  set,  by  virtue  of  the  estimate  of  the  real  denomina¬ 
tor  of  (26 »  going  through  zero  (or  being  complex  if  (22)  is  ignoredi  at  some 
values  of  f.  This  is  not  a  significant  problem  for  (14)  since  both  the  real  and 
imaginary  parts  of  the  estimate  of  (13)  must  simultaneously  equal  zero  there, 
in  order  to  constitute  a  problem. 

Another  important  practical  drawback  of  this  interpolation  approach  is  that 
ensemble  average  |«g|2  would  probably  be  approximated  by  £  w^ere 

the  sum  Is  conducted  over  those  values  of  k  at  which  a  meaningful  value  of 
error  can  be  formed  for  a  segment  of  a  single  member  function  of  an  en¬ 
semble.  But  since  the  minimum-error  sequence  is  not  uncorrelated  m 

this  case  (see  (23)  i,  minimization  of  £  '*k*~  f°r  a  sir>gle  member  function 

.  .  O 

segment  is  not  synonymous  with  minimization  of  ,  rather,  the  nunimiza- 
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tion  of  ~  i*kl^  spuriously  involve  correlation  between  adjacent  terms 

which  are  not  included  in  j and  which  will  bias  the  filter  coefficients. 
Several  simulation  runs  (on  real  data)  confirmed  this  conclusion  by  yielding 
severely  biased  (and  negative)  estimates  of  spectrum  Gx(f),  even  when  (22)  was 
taken  into  account.  Accordingly,  the  interpolation  approach  was  dropped  from 
further  consideration. 
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3.  CORRELATION  KNOWN  EXACTLY  FOR  A  LIMITED 
RANGE  OF  ARGUMENT  VALUES 

In  this  section,  R^  of  (1)  is  assumed  to  be  known  exactly  for  k.  <  p  and 
unknown  for  ikl  >  p.  Since  we  are  unable  to  compute  the  exact  spectrum  Gx(f), 
given  by  (2),  in  this  case,  a  different  approach  involving  approximation  to  Gx(fi 
is  required.  Three  different  techniques  will  be  considered  and  shown  to  yield 
identically  the  same  approximation  to  Gx(f). 

3.1  MAXIMUM  ENTROPY  SPECTRAL 
ANALYSIS  (MESA) 

The  method  in  this  subsection  was  originally  given  in  reference  Is  and 
elaborated  upon  in  reference  19.  We  begin  with  (2)  and  note  that 


_1_ 

21 


j"  df  Gx (f >  exp(i2rfkJi)  =  df  G^ff)  expii2-fkAi  = 

1  1/A 


_1 

21 


(29i 


We  wish  to  approximate  G  (f)  by  a  real  nonnegative  function  G (f >  such  tnat  its 
entropy  (reference  Is,  equation  (1)  ) 


1 


dfjln  G(f) 


(30i 


is  maximized,  subject  to  the  integral  constraints 


/ 

1/A 


df  G (f )  exp(i2^-fk^l  =  R^,  k  <  p. 


(31) 


To  this  aim,  we  form  the  quantity 


Q  =  I  df  Jn  G(f)  -  ^  «  f  df  G(f)  exp(i2*fkJkj 


(32) 
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where  Lagrange  multipliers  because  of  the  restriction  R_k  =  r£  ,  as 

shown  in  (1).  We  perform  a  variation  of  (32)  according  to 

P  , 

-  £  df  jGQ(f)  +  en  (f)j  exp(i2fffkA)  , 

k='P  l/l  (33) 

where  GQ(f)  is  the  "optimum"  approximation  to  Gx(f)  under  criterion  (30),  and 
obtain,  upon  setting 


Q+JQ  = 


j  df  J2n  ^GQ(f)  +  «n  (f)J 

1/i 


cHQ  +  6Q) 

dt 


0  at 


=  0, 


the  relation 


Gom 


1 


£  exp(i2;rfk.i) 
k=-p 


(34) 


(35) 


G^ (f )  is  real  since 


Since  it  is  also  to  be  nonnegative,  we  can  express 


where 


G  (f) 
o 


P 

?<f)  =  £  *>k  exp(i2fffk^»,  f  -  — , 

k  =  0 


(36) 


(37) 


and  where  ><fi  has  no  zeros  in  the  upper-half  complex  f-plane,  that  is,  poly¬ 
nomial 


P 

B(z>  =  £  ak  z 
k  =  0 


(3Si 
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has  no  zeros  inside  the  unit  circle,  O,  in  the  complex  z -plane.  A  proof  that 
B(z)  in  (38)  has  no  zeros  inside  O  is  given  in  reference  11,  page  7,  for  exam¬ 
ple.*  Specifically,  it  is  shown  that  B(l/z)  has  all  its  poles  and  zeros  inside  O; 
that  is,  B(l/z)  is  minimum  phase. 

In  order  to  determine  the  constants  in  (37),  we  express  (36)  as 


lfl  <  2i  ■ 


(39) 


(We  could  equally  well  have  multiplied  by  y(f)  in  the  following.)  Therefore,  for 
all  values  of  Jl , 


df  Go(f)>*(f)  exp(i2ff£iA) 


df 


1 

7(f) 


exp(i2fffj?A)  . 


But  using  (37),  this  can  be  expressed  as 


(40) 


]T  o*  f  df  GQ(f)  exp(i2rf(i-k)A)  = 
k=  o 


df  exp(i2?rfjg^) 

P 

^2  aj.  exp(i2rfkA) 
k=  0 


all  (41) 


Now  if  j2  is  an  integer  in  the  range  (0,  p]  ,  the  integral  on  the  left  side  of  (41)  is 
equal  to  R; (via  (29)  )  for  any  value  of  k  in  its  range  [0,  p]  ;  this  is  where  the 
constraints  are  employed.  Therefore,  we  have  for  integer  JL, 

P 

E  R,  k  al =  b/  ’ 0  - 2  - p>  (42) 

k=0  '  K  k 

where 


exp(i2??fi^) 

_ 

£  <* ^  exp(i2nfk^) 
k=  0 


0  <  i  <  p  . 


(43) 


‘The  proof  is  couched  m  terms  of  the  recursive  solution  of  (46i  presented 
in  append  lx  A. 
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In  (43),  letting  z  =  exp(i2/rfA)  and  using  (38),  we  have 


b,  =  -L  ^  il  ,  o 

*  i2jrd  J'  z  B(z) 


<  i  <  p  , 


where  #  denotes  counterclockwise  integration  around  the  unit  circle  O  in  the 
complex  z-plane.  Now  B(z)  has  no  zeros  inside  O  by  construction.  Further¬ 
more,  B(z)  can  have  no  zeros  on  O,  for  then  y(f)  would  be  zero  for  some  f,  and 
GQ(f)  would  possess  infinite  power,  contradicting  R0<  <».  Then  (44)  yields 


b,  =  7—  ‘to  ■  0  s  1  s  p  ■ 
&an 


and  (42)  becomes 


1^  —  A  u  a  n 


This  is  p-^1  linear  equations  in  p+1  unknowns .  * 
Now  let  correlation  matrix  R  be  defined  as 


R  R  , 
o  -1 


R,  R 
1  o 


and  define  two  column  matrices 


‘  -  U  0  0  ...  0] T  ,  «  =  [a  a  ...  a  1T 

o  1  p 1 


i  itiun  ot  ( -4 1 ; »  us  presented  in  appendix  A. 
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R  is  Hermitian,  Toeplitz,  and  nonnegative  definite.  Then  (46)  can  be  expressed 
as 


R  a  *  = 


Aa, 


(49) 


with  solution 


i  *  — 


_L_  R-1  s 


A  a. 


(50) 


Now  let  the  inverse  matrix 


c  c  ...  c 
00  ol  op 


c  c 
lo  11 


po 


pp 


(51) 


Then  (50)  and  (48)  yield 


I 

a  -  -  C  ,  a 

0  Aan  00  O' 


1/2 

1  fC°°^ 

=  T  c  ,  a  =  l - ) 

A  OO  O  \  A  / 


exp(ie) , 


(52) 


where  0  is  an  arbitrary  real  constant.  (c00  is  always  real.)  Utilizing  this  re¬ 
sult  and  (15)  in  (50),  there  follows 


'ko 


exp(-  i  e) ,  0  <  k<  p  , 


(53) 


oo 
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and  (37)  becomes 


]T  c*  exp (i2fffkA),  If!  <  j- 


/Ac  k=  0 

'  oo 


Finally,  using  (36),  the  "optimum"  spectrum  (called  the  maximum  entropy 
spectrum)  is 


G  (f)  =  - - 

o'  P 


• if!  <  fr 


^2  c  exp(- i2rrfkA) 


Equation  (55)  gives  the  maximum  entropy  spectrum  in  terms  of  the  first 
column  of  the  inverse  of  the  correlation  matrix  R  of  available  known  correlation 
values;  see  (47).  The  forms  of  (55)  and  (46)  are  similar  to  those  encountered 
earlier  in  (14)  and  (7),  respectively;  see  also  appendix  A.  The  maximum  value 
of  the  entropy  defined  in  (30)  is  evaluated  in  appendix  B  and  is  given  by  in  (a/cqo). 


Substitution  of  (53)  in  (38)  yields 


B(z)  =  SiEfcii!  £  c  (56) 

VJcT"  k=0 

^  00 

p 

Thus,  investigation  of  the  zeros  of  B(z)  depends  on  the  polynomial  ~  ck  z^ ; 

k=  0 

it  must  have  no  zeros  inside  the  unit  circle  0.  But  if  we  combine  (46)  and  (53), 
we  can  write  that 


£  R,.kc - V 

k=  0  oo 


Now  reference  1,  page  567,  declares  that  all  the  zeros  of  z  Ci._  z~k  must  lie 

k  =  0  K0 

inside  O  since  R  is  a  correlation  matrix.  Therefore,  polynomial  B(zi  has  no 
zeros  inside  O. 
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3.2  LINEAR  PREDICTIVE  FILTERING 


Here,  as  in  the  previous  subsection,  the  available  information  is  knowledge 
of  R^  for  Ik!  <  p.  A  linear  one-step  prediction  of  x^,  based  on  the  past  p  values, 
xj^,  ....  xj^p,  is  to  be  accomplished  with  minimum  average  magnitude- 
squared  error;  see  figure  1.  Now,  however,  instead  of  (3),  we  have  for  the 
predicted  value  the  finite  sum* 


z 


n=  1 


a  x 
n 


k-n  ’ 


The  instantaneous  error  is 


k 


P 


E 

n=  0 


a  x. 
n  k-n 


(58) 


(59) 


(Equations  (58)  and  (59)  constitute  stable  digital  filters  regardless  of  the  choice 
of  coefficients.)  The  ensemble  average  magnitude-squared  error  is 


E  = 


P 

z 


m,n=  0 


a  a* 
m  n 


R 

n-m 


a 


where  we  have  used  (1) 


and  (47)  and  defined 


a 


la 


(60) 


(61) 


We  now  wish  to  minimize  E  by  choice  of  filter  coefficients  jan[  .  How¬ 
ever,  we  have  the  constraint  on  a0  in  (59);  this  can  be  expressed  mathematically 
as 


•  H  »  =  -  1 ,  (62) 

where  i  is  defined  in  (48).  In  order  to  minimize  (60)  subject  to  (62),  we  form 
the  quantity 


*  p 

The  more  general  form  including  V  b^  x^_n  is  not  considered  here. 
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H  _  H  *  T  * 

a  R  a  -  X  a  J  -  X  a  S 

=  (a  -  XR-1«)H  R  ( a  -  X  R _1  J)  -  IXI2JHR-1*,  (63) 

where  R-l  is  defined  in  (51).  Since  R  is  nonnegative  definite,  being  a  correla¬ 
tion  matrix,  (63)  is  minimized  by  the  choice  of  coefficients 

a  =  X  R_1  *.  (64) 

The  Lagrange  multiplier  X  is  obtained  by  substituting  (64)  in  constraint  (62),  and 
using  (51)  and  (48): 

a*1Hr_13  =  -  1  ,  X  =  -  — .  (65) 

c 

oo 

Then  (64)  yields 

Cko 

a^  =  -  - ,  0  <  k  <  p  .  (66) 

^  oo 

The  minimum  value  of  the  error  power  is  found  by  utilizing  (64)  and  (65)  in 

(60): 

9  p  li  i  i  9  i 

E  =  1 7  1“  =  X'  •  R~  RR~  $  =  X  “c  =  -L_,  (67) 

o  I  k  I  oo  c 

oo 

where  is  the  minimum-error  sequence  obtained  by  employing  (66)  in  (59). 

(A  recursion  relation  for  E^  is  presented  in  (A— 7) ;  it  can  be  started  with 
l/c^  =  rq  .)  Notice  from  (67)  that  cqo  must  be  positive,  for  non- negative 

definite  R  . 

The  transfer  function  of  the  optimum  error  filter  from  input  x  to  output  7 
in  figure  1  is,  from  (59)  and  (6G), 
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A(f)  =  £  exp(-i2?rfkA) 
k=  0 


=  -  -  Y\  c  exp  (-  i2»fkA),  t  f  1  <  — 

c  *—*  ko  2A 

oo  k=  0 


Furthermore,  the  spectra  in  figure  1  are  related  by 


S  2 


G-(f)  =  1  A(f)|  Gx(f)  . 


(68) 


(69) 


Now  let  us  assume  that  the  spectrum  of  the  minimum-error  sequence  is 
white  over  the  band  (-“,  r— );  this  is  in  line  with  the  property  (11)  which  held 

Li 

for  the  case  when  the  infinite  past  was  available.  Then  we  say 


Ifl  < 


1 

2A  * 


(70) 


where  we  have  used  (67).  Substitution  of  (68)  and  (70)  in  (69)  yields  the  linear 
predictive  spectrum  approximation  to  the  input  spectrum  according  to  the  defi¬ 
nition 


Gx(f)  , 


Gr(f) 

i  a  (f)i 2 


AC 

_ oo _ 

P 

£  C^o  exp(-  i2  rr  fkA) 
k=  0  ° 


2 


(71) 


This  is  identical  to  the  approximation  (55)  obtained  by  MESA.  It  is  critically 
dependent  on  the  assumption  that  the  spectrum  of  the  minimum-error  T  in  fig¬ 
ure  1  is  white. 

Since  (71)  is  identical  with  the  maximum  entropy  spectrum,  (55),  it  must 
follow  that 


dfG.(f)  exp(i2*fkA)  =  R  for  Iki  <p; 

X  K 


(72) 
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that  is,  although  not  specified  in  the  current  approach,  the  correlation  function 
formed  from  the  linear  predictive  spectrum  Qx(f)  in  (71)  has  the  same  values  at 
ka  for  Ikl  <p  as  the  known  correlation  values  {Rki  • 


The  implications  of  the  assumption  (70)  of  a  white  spectrum  for  the  mini¬ 
mum  error  are  investigated  in  appendix  C.  It  is  shown  that  the  crosscorrela¬ 
tion  function  between  input  x  and  output  7  of  figure  1, 


C 


/ 


(73) 


must  then  satisfy 


/  1/c  ,  j2  =  0 

(  oo 


(74) 


that  is,  minimum-error  sequence  {7k|  is  assumed  uncorrelated  with  all  the 
past  values  of  the  input.  It  is  also  shown  that  the  unknown  correlation  values 
for  k  >  p  can  be  approximated  according  to 


Pc  P 

=  -  E  —  K  =  E  *  Rt  ,  k  >  p  + 1 . 

k  *-*  c  k-n  n  k-n 

n  =  1  oo  n  =  1 


(75) 


This  recursion  relation,  starting  with  known  values  Rj_,  . . . ,  Rp,  can  be  con¬ 
sidered  to  be  an  extrapolation  of  the  known  correlation  values  into  regions 
where  they  are  unknown.  Equation  (75)  is  shown  in  appendix  D  to  be  a  stable 
recursion  when  B(z)  of  (56)  has  no  zeros  inside  O;  this  property  has  been  dis¬ 
cussed  under  (38),  (56),  and  (A-9).  It  can  also  be  shown  that  Fourier  transfor¬ 
mation  of  the  extrapolated  correlation  approximants  yields  precisely  (71).  It  is 
interesting  to  note  that  (75)  has  the  same  form  as  the  predictive  equation  (58)  for 
individual  data  values. 


Since 


df  G^ff)  exp(i2wfkA) 


(76) 


is  the  autocorrelation  at  delay  ka,  it  is  given  by  (72)  for  k  <  p,  and  by  (75)  for 
k  >p  +  1,  where  the  latter  correlations  are  extrapolated  values.  This  follows 
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A  A 

from  setting  G~(f)  white  and  choosing  Gx(f)  by  (71),  according  to  the  analysis  in 
appendixes  C  and  D. 

If  sample  values  x^+i,  . . x^+p  were  used  to  linearly  "predict"  x^  ac¬ 
cording  to  backward  regression 


c  =  Y*  a*  x  , 
k  n  k+n 

n=  1 


(77A) 


the  one-step  error  «k  =  xk  ~  xk  ^as  average  magnitude- squared  value 


E  =  k 


P 

E 

m,n=  0 


a  a  R 
m  n  n-m 


(ao=-i), 


<77B) 


which  is  identical  to  (60).  Thus,  the  same  optimum  filter  coefficients  in  (66) 
that  minimized  (60)  would  also  minimize  (77B),  and  an  approach  similar  to  that 
above  would  yield  a  spectral  approximation  identical  to  (71).  The  equivalence 
of  the  results  of  this  backward  regression  to  that  of  the  forward  regression  in 
(58)  will  prove  useful  later  when  we  deal  with  finite  data  sets  and  unknown 
correlation  functions. 


3.3  ALL-POLE  DIGITAL  FILTER 
MODEL 

The  available  information  about  process  jx^j  is  the  same  as  in  the  previ¬ 
ous  two  subsections,  namely,  knowledge  of  R^  for  ikl<  p.  Consider  a  sampled 
autoregressive  process  jy^}  in  steady  state  generated  by  a  stable  all-pole 
digital  filter,  H(z),  excited  by  discrete  white  noise  jw^}  ;  see  figure  2.  The 
noise  is  characterized  by  correlation 


w,  w*  =6  ,  all  n,  (78A) 

k  k-n  no 

with  no  loss  of  generality,  and  has  spectrum 

Gw(f)  =  A  ,  Ifl  <  .  (78B) 
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w 


k 


ALL-POLE 
DIGITAL  FILTER 
H(z) 


yk 


Figure  2.  Generation  of  All-Pole  Process 


The  digital  filter  is  characterized  by  a  p-th  order  autoregressive  relationship, 


with  transfer  function 


P 


z  ^„yk-n 

n=  0 


w. 


» 


H(z) 


1 


P 

Z 

n=  0 


P 


n 


-n 

z 


(79) 


(80) 


We  are  going  to  choose  digital  filter  coefficients  |(3n[  q  so  that  autoreSres~ 
sive  process  |yj<}  has  the  same  correlation  values  as  process  jx^}  ,  up 
through  order  p;  that  is,  we  will  set 

y .  y  *  =  R  for  ml  <  p.  (81) 

k  k-n  n  _ 


Then  the  spectrum  of  autoregressive  process  jyk|  ,  given  by 


G  (f)  =  G  (f) 
y  w 


H  (exp(i2  n  f  a)) 


_ a _ 

P 

£  Pn  exp(-  i2  -  fna) 
n=  0 


(82) 


oo 
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will  be  used  as  an  approximation  to  the  spectrum  of  .  The  spectral  rela¬ 

tion  in  (82)  holds  only  if  H(z)  is  stable;  that  is,  all  the  zeros  of  the  denominator 
of  (80)  must  lie  inside  O. 

In  order  to  evaluate  the  filter  coefficients  j  0n }  P  ,  we  notice  that 

w  y*  =  0  for  n  >  0  (83) 

k  k-n  ' 


since  noise  samples  jw^|  are  uncorrelated  (see  (78))  and  filter  H(z>  is  realiz¬ 
able  (see  (79)).  The  first  step  we  take  is  to  express  (79)  as 


y 


k 


P 


E 

n=  1 


(84) 


Then  using  (78)  and  (83), 


w 


kyk 


(85) 


Now  multiply  both  sides  of  (79)  by  y£_ f  and  average;  there  follows 


P 


y  0  r 

n  f-n 

n=  0 


JL_ 


0  <  &  <  p  , 


(86) 


upon  use  of  (81),  (83),  and  (85).  Now  if  we  let  /3n  =  >J3  ,  (86)  becomes  iden¬ 

tical  to  (46).  Therefore,  we  can  use  solution  (53)  to  obtain  for  the  filter  coeffi¬ 
cients 


c 

oo 

Pn  =  -j==  exp(-  i  e ) ,  0  <1  n  <  p  ,  (87) 

•J  oo 


where  0  is  an  arbitrary  real  constant. 
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Substitution  of  (87)  in  (82)  yields  the  autoregressive  spectrum  approximate: 
to  the  input  spectrum  as 


GX«  5  G/>  *  TT 


■  I  <  XT  • 


V  c  exp(-  L2  rr  fna) 

2-*  no 


This  is  identical  to  the  maximum  entropy  spectrum  (55)  and  the  linear  predic¬ 
tive  spectrum  (71).  The  discussion  surrounding  (76)  is  relevant  here  also. 

Substitution  of  (87)  in  digital  filter  (80)  yields 


H(z)  = 


fc  exp(if)) 
•J  oo 

~P 

E-n 

c  z 

nn 


This  is  stable  if  the  denominator  contains  ail  its  zeros  within  O;  that  is,  H(z)  is 
stable  if  and  only  if  B(z)  of  (56)  has  no  zeros  inside  O.  This  property  has 
already  been  shown  true  in  the  discussions  under  (3S),  (56),  and  (A-9). 

The  relationship  in  (86)  can  be  extended  to  i  =  p  -  1  with  the  result  that 


Y  R  .  =  0,  (91 

*—•  n  p-rl-n 
n=  0 

where  Rp-ri  is  now  interpreted  as  the  value  of  y^  y£-p-i>  and  was  never  speci¬ 
fied.  If  we  combine  (90)  with  the  last  p  equations  of  (86),  we  obtain 


Y  l 3  R  =  0,  1  <  Jt  <  p  +  1  . 
*-*  n  f-n  -  - 


In  order  for  this  set  of  p  +  1  linear  equations  to  possess  a  nonzero  solution  for 
|  (as  it  did  above),  we  must  have 
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This  can  be  solved*  for  Rp+i-  But  since  this  is  identical  with  reference  19, 
equation  (1) ,  we  see  that  the  all-pole  digital  filter  model  is  identical  to  choosing 
Rp+1  such  that 


is  maximized.  Additional  interpretations  of  (93)  in  terms  of  maximum  uncer¬ 
tainty  and  entropy  are  presented  in  references  20  and  14. 


*Of  course,  a  far  more  practical  method  is  given  by  (90)  and  (87),  and 
more  generally  by  (75). 
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4.  CORRELATION  UNKNOWN;  FINITE  DATA  SET 

In  this  section,  the  correlation  values  {R^l  are  unknown,  and  the  only 
information  available  about  the  random  process  x(t)  is  a  finite  set  of  N  samples 
Xj,  ...  xN  ,  from  which  we  remove  the  sample  mean.  From  these  N  samples, 
wedesirean  estimate  of  the  spectrum  Gx(f).  Yet  we  can  not  minimize  or  utilize 
any  ensemble  averages  as  was  done  in  sections  2  and  3,  since  we  have  onlv  a 
finite  segment  of  one  member  function  to  work  with. 

The  MESA  and  autoregressive  methods  of  subsections  3.1  and  3.3  are  not 
easily  directly  extended  to  the  case  of  unknown  correlation,  because  they  make 
explicit  use  of  this  correlation  information;  see  (31)  and  (81),  respectively.  A 
direct  extension  oi  these  two  methods  would  require  us  to  decide  on  the  form  of 
the  correlation  estimates  a  priori,  and  could  unnecessarily  restrict  the  quality 
of  the  spectral  estimate  we  finally  obtain.  The  linear  predictive  method  of  sec¬ 
tions  2  and  3.2,  on  the  otherhand,  requires  that  the  ensemble  average  magnitude- 
squarederror  be  replaced  by  some  estimating  quantity  that  can  be  readily  calculated 
from  the  available  data  {xn}  ^  ;  as  a  by-product,  we  may  get  estimates  of  the  cor¬ 
relation.  Several  candidate  processing  techniques  based  on  subsection  3.  2  will 
therefore  be  considered,  and  their  processing  equations  derived.  Also,  some 
of  the  results  of  subsection  3. 1  on  MESA  will  be  adapted  and  combined  with  the 
linear  predictive  approach  to  form  a  viabij  approach  to  spectral  estimation; 
this  technique  was  originally  presented  by  Burg  in  reference  21.  In  section  6, 
all  the  techniques  will  be  compared  by  means  of  simulation. 

4.1  YULE-WALKER  EQUATIONS 

We  begin  by  defining  in  this  subsection 

x,  =  0  for  k-"l,  k  >N  ,  (94) 

k 

since  these  samples  are  unavailable.  Taking  (58)  in  subsection  3.  2  as  a  guide,  we 
attempt  a  linear  prediction  according  to 

\  =  E  Vk-n  • 211  “  •  (95) 

n=  1 

where  the  choice  of  p  is  arbitrary  for  the  moment.  It  should  be  noticed  that 
although  x^  is  defined  for  all  k,  it  is  not  expected  to  have  a  good  chance  of  ac¬ 
curately  predicting  for  k<p  or  k  >N  +  2  since  some  zero  values  of  x^  have 
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been  utilized  in  those  regions,  according  to  (94).  Nevertheless,  we  define  an 
instantaneous  error 


«k  =  *k  -  *k  = 


P 


E 

n=  0 


ax,  ,  all  k 
n  K-n 


(96) 


it  is  expected  to  be  valid  or  meaningful,  however,  only  if  k  >p  +  l  and  k  <  N 
(error  must  utilize  a  zero  value  for  xjj+i).  Digital  filtering  operations 

(95)  and  (96)  are  stable  for  any  choice  of  coefficients  |aT1|  . 


Since  we  cannot  compute  an  ensemble  average  magnitude-squared  error 
now,  an  average  magnitude- squared  error  is  defined  for  the  available  data  of 
the  single  member  function  as 


F 


v-  *  1 

>  a  a  — 
4-  m  n  N 
m,n=  0 


£  Wk-n  ' 
k 


(97) 


where  ^  denotes  summation  over  all  nonzero  values  of  the  summand  |«j.|2  . 
regardless  of  how  meaningful  they  are.  The  normalizing  factor  1/N  is  some¬ 
what  arbitrary;  there  are  N+p  nonzero  terms  in  the  first  sum  in  (97> ,  but  only 
N-p  meaningful  terms. 


We  define,  for  all  n,m 


S 

n-m 


_L 

N 


L 

k 


x.  x 
k-m  k-n 


S*  , 
m-n 


in  which  case  (97)  yields 


F 


P 


m ,  n  =  0 


This  relation  uses  S.-  only  for  *  ■  p.  L-. 
coefficients  ja^}^,  we  compote 
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The  optimum  coefficients  |  a^[  P  are  therefore  solutions  of  the  p  linear  equations 
P 

£  S/-m  <*=  V1*  <10l> 

m=  0 


or 


P 


E 

m  =  1 


S  a 
l-m  m 


1  <j2  <  p. 


(102) 


These  are  the  Yule-Walker  equations  for  the  optimum  filter  coefficients .  The 
method  here  is  called  the  autocorrelation  method  in  reference  1.  (As  an  aside,  in 
analogy  to  subsections  2.2  and  3.2,  identically  the  same  equations  (102)  result 
when  Xk  is  predicted  on  the  basis  of  p  future  values,  rather  than  p  past  values 
as  was  done  here  in  (95);  see  (5)  and  (17)  et  seq.  and  (77)  et  seq.) 


The  minimum  value  of  average  error  F  is  obtained  by  substituting  (101)  in 
(97)  and  (99): 


III 

o 

p4 

£ 

l~k| 

2 

P 

E 

a* 

n 

P 

T  s  H 

*—•  n  n-m  m 

k 

n=  0 

m=  0 

P 

P 

=  ~ 

£ 

A 

s* 

m 

a 

m 

=  S 

0 

- 

T  S*  a  , 
m  m 

=  2 


P 

E 


S  3 
o  4— '  -m  m 

m=  0 


(103) 


where  we  have  employed  (98)  and  (101). 

Thepxpmatrix  lSf_m)  ?  on  the  left  side  of  (102)  has  the  form  of  a  legal 
correlation  matrix  in  that  it  is  Hermitian,  Toeplitz,  and  nonnegative  definite. 
The  last  property  follows  from 

P  ^  P 

^  S|-mam°t  ^  ama/  N  Xk-m*k-/ 

f,m  =  l  / ,  m=  1  k 


m=  l 


a  x. 
m  k-m 


>  0 


(104) 
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for  any  {«m[  l  •  Since  (104)  is  greater  than  zero  with  probability  one,  (102) 
will  possess  a  solution  with  probability  one. 

A  convenient  method  of  obtaining  this  solution  is  to  combine  (101)  and  (103) 
to  get 


P 

“  £  S«  m  =  Fo  >  °  *  *  *  P  • 

f-m  m  o  lo 

m  =  0 

Written  out  in  detail,  this  is 


S  S  •  e  •  S 

”  1  “ 

"f  ' 

o  -1  -p 

o 

st  s 

-X 

0 

1  o 

i 

•  • 

•  • 

• 

• 

• 

• 

• 

• 

• 

s  s 

-a 

0 

p  o_ 

P_ 

(105) 


(106) 


(The  (p+1)  x  (P+l)  matrix  in  (106)  is  nonnegative  definite,  as  a  simple  extension 
of  (104)  shows.)  But  (106)  is  identical  in  form  to  (A-3),  and  the  recursive  solu¬ 
tion  presented  in  (A-4)  through  (A-7)  applies  directly. 


The  spectral  estimate  we  adopt  follows  from  results  (68)  through  (71)  in  sub 
section  3.2  on  linear  predictive  filtering  for  known  correlation  values:  first, 
the  optimum  transfer  function  leading  from  jx^J  to  minimum-error  sequence 
|7k|  in  (96)  is 


P 

A(f)  *  £  a  exp(- i2*fnh) .  (107) 

n 

n«  0 

However,  we  have  a  problem  in  trying  to  accurately  estimate  the  average  mini¬ 
mum-error  power  that  would  be  used  in  the  numerator  of  the  assumed  white 
spectrum  for  the  error  in  (70).  Although  minimum  average  error  F0  of  (103) 
could  be  used,  it  is  not  recommended  because  not  all  the  error  terms  in  the 
sum  in  definition  (97)  are  meaningful.  Therefore,  because  of  our  inability  to 
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accurately  estimate  the  average  minimum-error  power  in  this  case,  we  shall 
adopt  as  our  spectral  estimate 


A 


£  a  exp(-i2»fnA) 
n-0  “ 


(108) 


This  is  tantamount  to  assuming  the  average  minimum-error  power  equal  to 
unity  (in  addition  to  assuming  the  minimum-error  spectrum  white) .  This  pro¬ 
cedure  also  eliminates  level  perturbations  in  the  spectral  estimate  (108)  due  to 
random  fluctuations  in  the  absolute  level  of  the  sample  set  {xnf  ;  that  is,  from 
(102)  and  (98),  it  is  seen  that  the  optimum  values  of  the  filter  coefficients, 

|  a^l  P  ,  would  be  the  same  if  {KXg}  ^  were  the  available  samples,  for  any  K. 
Therefore,  estimate  Gx(f)  in  (108)  is  also  independent  of  the  absolute  level  of 
the  available  samples.  The  choice  (108)  allows  for  convenient  comparisons  of 
the  spectral  estimates  obtained  by  the  various  methods  presented  here. 

As  an  alternative,  (108)  could  be  numerically  integrated  over  (--^  ,  ) , 

and  then  (108)  could  be  scaled  so  that  the  area  under  the  estimated  spectrum  is 

equal  to  the  sample  power,  r  jx_j 2,  if  desired. 

n*  1  “ 

The  implications  of  the  assumption  in  (108)  that  the  minimum-error  se¬ 
quence  has  a  white  spectrum  are  investigated  in  appendix  E .  It  is  shown  that 
the  sample  crosscorrelation  between  input  sequence  jx^}  and  minimum-error 
sequence  {7^}  ,  defined  for  the  available  data  of  the  single  member  function  as 

■>,  -  £  £  \  *i-i  • 111 1  •  (109) 

k 

is  assumed  to  satisfy 

Df  -  0,  l  <  i  ;  (110) 

that  is,  the  minimum-error  sequence  is  uncorrelated  (on  a  single  member  func¬ 
tion  basis)  with  all  the  past  input.  It  is  also  shown  that  the  quantities  Sf  de- 
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fined  in  (98)  (of  which  only  8t  for  p  were  used  in  (99)  et  seq.)  can  be 
estimated  for  1  >  p  +  1  according  to 

P 

si  -  £  \  8 #_n.i  ^p  + 1.  an) 

n«l 

This  relation,  (111),  which  may  not  be  true  for  the  quantities  St  actually  ob¬ 
tained  from  data  {xq}  ^  via  (98),  is  due  directly  to  the  assumption  that  the 
sample  spectrum  of  the  minimum -error  is  white;  see  appendix  E.  The  recur¬ 
sion  relation  (111)  is  stable,  according  to  appendix  D,  if 

P 

l  -  E  t  rn  aw) 

,  n 

n«l 

possesses  all  its  zeros  within  O.  But  since  matrix  in  (102)  has  the 

form  of  a  legal  correlation  matrix,  we  appeal  directly  to  reference  1,  page 
567,  to  state  that  this  property  does  indeed  hold.  Therefore,  recursion  (111) 
is  stable. 

It  is  worthwhile  noting  that  no  direct  estimation  of  unknown  correlation  val¬ 
ues  {Rk}  was  attempted  in  this  approach;  rather,  we  minimized  the  average 
error  defined  in  (97)  and  solved  directly  for  the  filter  coefficients  in  (102) . 
However,  if  we  rewrite  (105)  in  the  form 

P 

£  S#  P*  <U3> 

'  — m  m  o  «o 

m  ■  0 

and  compare  with  (C-3),  we  see  that  the  quantity  Sf  could  be  adopted  as  an  esti¬ 
mate  of  R|for  'ii  <  p,  that  is,  using  (98),  we  could  say 

R|  «  S|  -  £  £  xkx*_#  ,  I  i  i  <  P ,  (114) 

k 

(and  then  (111),  with  ft  replacing  S,  could  be  used  to  estimate  R,  for  1> p  «-  1, 
rather  than  (98) ).  This  is  in  fact  the  approach  adopted  by  some  authors;  see, 
for  example,  reference  2,  equation  (19).  However,  (114)  yields  biased  esti¬ 
mates  because 
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N-ljJI 


R,  ,  lil  <  N 


0,  otherwise 


(U5) 


It  Is  interesting  to  note  that  if  (114)  were  adopted  a  priori  as  estimates  of 
the  unknown  correlation  values  )  }  ,  then  the  MESA  and  autoregressive  ap¬ 

proaches  of  subsections  3.1  and  3.3  could  be  utilized  directly,  if  the  right  sides 
of  (31)  and  (81)  were  replaced  by  {R^}  .  The  spectral  estimates  would  then  be 
given  by  results  identical  to  (108),  except  for  a  scale  factor.  The  major  draw¬ 
back  of  this  approach  is  the  need  to  commit  oneself  to  a  specific  form  for  the 
correlation  estimates,  such  as  (114),  rather  than  letting  the  technique  itself 
yield  alternative  estimates.  The  specific  form  used  for  the  correlation  esti¬ 
mates  could  limit  the  quality  of  the  spectral  estimate  obtained;  this  contention 
is  proven  true  by  simulation  in  section  6. 


4.2  UNBIASED  VERSION  OF  YULE- 
WALKER  EQUATIONS 

One  method  of  obtaining  unbiased  estimates  of  the  correlation  values  {R,} 
is  to  define  estimators 


l<  £  Vk-(  ■  (HI.  £  Vk-(  ,or  #s  J  5p- 


N 

£ 

k  -  k-»*l 

A  * 

Of  course  R_#  »  R J  .  These  could  then  be  used  in  (102)  m  the  form 


(116) 


£  mTm‘  *'•  l(J^' 


(117) 


m»l 


to  solve  for  the  filter  coefficients  {  am|  ^  .  And  (108)  could  again  be  adopted  for 
the  spectral  estimate.  The  solution  for  the  coefficients  in  (117)  minimizes  no 
error  criterion;  it  merely  utilizes  unbiased  correlation  estimates.  The  dis¬ 
cussion  under  (115)  is  relevant  to  this  approach;  how  good  the  technique  is  will 
be  ascertained  in  section  6. 

The  matrix  (  Rf  .m)  f  of  estimated  correlation  values  on  the  left  side  of 
(117)  is  Hermitian  and  Toeplitz;  however,  it  is  not  necessarily  nonnegative 
definite.  (This  last  property  is  shown  by  considering  the  example  p>2,  N  *  3, 
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with  xi  -  2,  X2  ■  X3  *  3,  for  then  R0  *  22/3,  and  Rj_  *  15/2. )  The  recursive 
solution  of  appendix  A  could  again  be  applied  to  a  modified  form  of  (117) ;  see 
(105)  and  (106).  If  the  recursive  technique  in  (111)  were  utilized  to  extrapolate 
R,  according  to 


P 

R,  s  I  ^n^-n*  1  -p  +  1  (ll8) 

n*  1 

and  (116),  it  need  not  be  stable  unles  (Rf  _m]  P  is  nonnegative  definite.  Even  if 
(118)  were  unstable,  (108)  could  still  be  used  as  a  spectral  estimate  of  Gx(f); 
there  would,  however,  be  a  greater  tendency  of  some  pole-pairs  of  (108)  to 
drift  close  to  the  unit  circle,  O,  in  the  z -plane  and  give  rise  to  spurious  large 
peaks  in  the  spectral  estimate .  This  tendency  is  reduced  for  stable  recursions 
(118),  that  is,  if  (112)  possesses  all  its  zeros  within  O. 

4.3  LEAST-SQUARES  ESTIMATES  OF 
BOX  AND  JENKINS 

In  reference  16,  appendix  A7.5,  a  likelihood  function  approach  to  estima¬ 
tion  of  the  coefficients  in  an  all -pole  (that  is,  autoregressive)  filter  model  for 
generation  of  the  process  jx^ }  is  considered.  The  end  result  (in  our  notation) 
is  given  in  (A7.5.7)  for  real  data  by 


N-i-j 

S.  a  ~  D.  ,  .  •  —  £  x  x  ,, .  0  <  i,  j  <  p 

ij  N  i-l,ri  N  ~  He  rk 

k*  i 

and  in  (A7.5.I5)  by 


52  S?  *  S  ,  1  <  i  <  p  . 
i)  j  io  K 


J-l 


(119) 


(120) 


This  constitutes  p  linear  equations  in  the  p  unknowns  J  .  The  matrix  iSijl  ^ 
occurring  in  (120)  is  symmetric,  not  necessarily  Toepliu,  and  not  necessarily 
nonnegative  definite.  (The  last  property  is  shown  by  considering  the  example 
N  »  5,  p  ■  2,  with  X2  *  x3  *  X4  »  1,  for  then  S^  *  3/5,  Sj.2  *  S21  *  2/5,  S22  * 
1/5,  and  the  determinant  is  -  1/25.)  The  quantities  |S^|  also  yield  biased 
estimates  of  {R^_j(  .  because 
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S  =  N  ~  j  R.  .  .  (121) 

ij  N  i-j 

Nevertheless  we  will  adopt  (108)  for  our  spectral  estimate  in  this  case.  The 
fact  that  we  encounter  a  non-Toeplitz  matrix  in  (120)  disallows  the  use  of  the 
recursive  technique  for  solution  in  appendix  A. 

If  the  solution  to  (120)  is  substituted  in  (112),  the  zeros  need  not  all  lie  in¬ 
side  O.  Therefore,  there  would  be  a  greater  tendency  for  some  pole-pairs  of 
(108)  to  drift  close  to  O  than  when  all  the  zeros  must  lie  inside  O,  as  for  sub¬ 
section  4.1. 

4.4  APPROXIMATE  MAXIMUM  LIKELIHOOD 
ESTIMATES  OF  BOX  AND  JENKINS 


This  technique  is  a  slight  modification  of  the  previous  one  in  subsection 
4.3.  Namely,  in  reference  16,  under  (A?. 5. 18),  the  coefficients  are  solutions 
of 


£  Vt"  sio'  lsl-p' 
j-1 


(122) 


where  (see  (119) ) 


i  i  N'i'i 

V  tn77Dt-i.i-i  *  n^tt?  £  VkV  0Si')Sp- 

k-  1 


(123) 


The  matrix  l  St  j  ]  P  occurring  in  (122)  is  symmetric,  not  necessarily  Toeplitz, 
and  not  necessarily  nonnegative  definite.  (The  last  property  is  shown  by  con¬ 
sidering  the  example  N  ■  5,  p  *  2,  with  X2  *  2,  X3  *  1,  X4  *  2,  for  then  Sj^  *  3, 
S12  *  S2i  *  2,  S22  *  1,  and  the  determinant  of  these  coefficients  is  -1).  The 
quantities  {s^j  yield  unbiased  estimates  of  {Rj_j(  ;  however,  every  element  in 
a  particular  diagonal  can  be  different,  even  thougn  they  are  estimating  the  same 
quantity.  Also,  the  number  of  terms  (in  the  sum  in  (123) )  along  a  particular 
diagonal  varies  with  the  position  of  the  element,  thereby  yielding  differing  de¬ 
grees  of  stability.  Equation  (108)  can  be  used  with  (122)  to  obtain  the  spectral 
estimate.  Recursive  solution  of  (122)  is  not  allowed  because  of  the  non-Toeplitz 
character  of  the  matrix  ( Sij  J  ^  .  The  comments  at  the  end  ol  subsection  4.3 
are  relevant  here  also. 
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4.5  PREDICTION  USING  VALID  ERROR 
POINTS 


The  method  of  subsection  4. 1  utilized  an  average  error  measure  defined  over 
all  nonzero  error  terms;  see  (97).  However,  as  noted  under  (96),  instantaneous 
error  is  meaningful  only  if  k  >  p  ->-1  and  k  <  N.  Here  we  define  an  average 
magnitude-squared  error  by  summing  only  over  the  set  of  valid  error  points: 


F 


1 

N-p 


N 


Z 

k=p  + 1 


(124) 


There  are  N-p  terms  in  this  sum .  This  procedure  is  tantamount  to  not  running 
off  the  edges  of  the  available  data  |xn}^  .  Employing  (96),  (124)  can  be  writ¬ 
ten  as 


F  *  V*  a  a*  S  , 
m  n  nm 


m,n=  0 


(125) 


where 


N 


nm 


=  rri—  T  jlx*  =  S*  . 
N-p  K-m  k-n  mn 


(126) 


k*p  +1 


This  quantity  always  contains  N-p  terms  for  0  <  n,  m  <  p .  In  order  to 
minimize  F,  we  compute 


hp  P 

IS’  E  S,mV  P- 

f  m  =  0 


(127) 


The  optimum  predictive  coefficients  are  therefore  solutions  of 
P 

£  S  a  =  °.  l<i<P  (a  =a  =*-D ,  (128) 

„  /mm  oo 

m*0 
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or 


P 

£ 


m  =  l 


Sfm*m 


1  <JL  <p. 


(129) 


The  method  here  is  called  the  covariance  method  in  reference  1 . 


The  minimum  value  of  the  average  error  F  is  obtained  by  substituting  (128) 
in  (124)  and  (125): 


F 

o 


1 

N-p 


N 


£ 

k=  p  + 1 


P 


P 


£ 


S  a 
nm  m 


P 


E 


S  a 
om  m 


P 


=  -  £ 
m=  0 


S*  a  =  S 
mo  m  oo 


P 


£ 

m=  1 


S* 

mo 


(130) 


where  we  have  used  (126)  and  (128). 

The  p  x  p  matrix  [Sfm]  ^  on  the  left  side  of  (129)  is  a  legal  correlation 
matrix  in  that  it  is  Hermitian  and  nonnegative  definite.  The  last  property  fol¬ 
lows  from 


/  ,m=  1 


s,  “  <*; 

im  m  t 


P 

£ 

/ ,  m=  1 


k=p  +  l 


x,  x. 
k-m  k-/ 


1 

N-p 


N 


£ 

k=p  +  l 


P 


m  =  1 


(131) 


for  any  Since  [ S/m ]  is  not  necessarily  Toeplitz,  however,  the  re¬ 

cursive  solution  in  appendix  A  is  not  applicable.  Numerical  computation  of 
[Sjm]  is  eased  by  taking  advantage  of  a  recursive  relation  between  m+1 


The  spectral  estimate  we  adopt  is  given  by  (108).  However,  note  that  we 
could,  if  desired,  get  an  estimate  here  of  the  average  minimum  error  power 
E0,  used  in  (70),  according  to 
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(132) 


This  quantity  is  meaningful  because  (130)  involves  only  the  valid  error  terms. 

Equation  (129)  is  similar  to,  but  not  identical  with,  the  form  of  (117).  The 
quantities  { snxxif  *  defined  in  (126),  yield  unbiased  estimates  of  |Rn_m}  ;  how¬ 
ever,  every  element  in  a  particular  diagonal  can  be  different,  even  though  they 
are  estimating  the  same  quantity. 

If  the  solution  to  (129)  is  substituted  in  (112),  the  zeros  need  not  lie  inside 
O,  despite  the  nonnegative  definite  property  demonstrated  in  (131).  (The  ex¬ 
ample 


p  =  1,  N  =  2,  yields  a  =  *2/xi  (133) 

and  gives  a  zero  location  of  (112)  which  can  lie  anywhere  in  the  z-plane.) 
Therefore,  the  comments  at  the  end  of  subsection  4.3  are  relevant  here  also. 

4.6  FORWARD  AND  BACKWARD  PRE¬ 
DICTION  USING  VALID  ERROR  POINTS 

It  was  noted  in  subsections  2.2,  3.2,  and  4.1  that  "prediction"  based  on 
future  values  of  the  input  {x^}  yielded  an  equivalent  spectral  estimate  to  that 
obtained  by  prediction  based  on  past  values.  Here  we  combine  the  two  tech¬ 
niques  .  The  forward-predicted  value  of  is . 

P 

an  V„’»  +  1£ksN-  <134> 

n=  1 

where  we  limit  k  to  the  range  Ip  +  1,  NJ ,  in  anticipation  of  the  fact  that  we  can 
only  measure  valid  errors  in  that  range;  see  (96)  et  seq.  The  backward-pre¬ 
dicted  value  of  x^  is 


P 

V  £  a'„  Vn-  kiN-p'  <135’ 

n=  1 

where  we  again  limit  the  range  of  k.  (See,  for  example,  (15),  (22),  and  (77).) 
The  forward  and  backward  errors  are,  respectively, 
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=  \-x=T  ax,  ,  p  +  1  <  k  <  N , 
k  k  ^  n  k-n 
n=  0 


(136) 


v  v 

*k 


- xk=  £  is"£n-p. 

n=  0 


where  a  =  -  1. 
o 


An  overall  average  magnitude- squared  error  is  defined  as 


/  N  N-p 

'■iara(.E 


vk=p  +  l 


k=  1 


-  1 2' 


=  T  a  a*  S  ,  (137) 
*-*  m  n  nm 
m,n=  0 


where,  in  this  subsection, 


nm  “  2(N 


/  N  N-p  v 

N-p)  (  ^  Xk-m\-n  +  ^  Xk+nXk+m)  * 

^  \k=p+l  k= 1  / 


(138) 


This  quantity  always  contains  2  (N-p)  terms  for  0  <  n,  m  <  p .  Two  useful  prop¬ 


erties  of  S  are  immediately  available: 


S  =  S*  ,  S  =  S*  . 

mn  nm  p-n,  p-m  nm 


(139) 


These  properties,  plus  a  recursive  relation  relating  Sn+i,  m+i  and  Sj^,  ease 
the  numerical  computation  of  matrix  [  S^]  . 

We  minimize  F  by  choice  of  jamJ  ^  ,  getting  (see  (127)  -  (129) ) 

P 


VS  a  =  S  ,  1  £  n  £p 
nm  m  no 

m=  1 


(140) 


The  minimum  value  of  F  is  (see  (130) ) 
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P 

F  =  S  -  T  s*  a  . 
o  oo  mo  m 

m=  1 


(141) 


The  method  here  is  an  extended  version  of  the  covariance  approach  in  refer¬ 
ence  1. 


The  matrix  [Sfm]  ^  is  Hermitian  and  nonnegative  definite: 


P 

L 

!,  m  =  1 


C  *  _  _ • 

I  m  am  af  2(N-p)J 


/  N 

P 

2  N~P 

P 

2\ 

{  L 

£  "mVm 

+  E 

^  amXk+m 

)>0 

\k=  p  + 1 

m=  1 

k=  1 

m  =  1 

7142) 

for  any  |am| ^  .  However,  this  matrix  is  not  necessarily  Toeplitz;  therefore, 
we  cannot  apply  the  recursive  solution  of  appendix  A. 


The  spectral  estimate  we  adopt  is  obtained  by  substituting  the  solution  of 
(140)  in  (108).  An  estimate  of  the  average  minimum  error  power  E0,  used  in 
(70) ,  is  available  here  according  to 


(143) 


if  desired,  where  FQ  is  given  by  (141).  This  is  meaningful  because  (137)  util¬ 
ized  only  the  valid  error  terms. 

In  analogy  to  (126),  the  quantities  |Snmf  in  (138)  yield  unbiased  estimates 
of  |Rn-mf  •  Nevertheless,  if  the  solution  to  (140)  is  substituted  in  (112),  the 
zeros  need  not  lie  within  O,  despite  the  nonnegative  definite  property  shown  in 
(142).  For  p  =  1,  we  find 


a.  = 


Vi  +X3X2  +  +Vn-1 


112 
2  X1  +  X2 


2  I  2  2 

+  |x3  ...  +  xN2  +  Xy.j. 


2  1 
+  2  XN 


And  since 


> 

2  xk-] 

2  1 

L  +  2  3 

J2 

<1, 


2  * 

(144) 

(145) 
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it  follows  that 


1^1  <  1 .  (146) 

So  for  p  =  1,  the  zero  of  (112)  must  lie  within  O,  (unless  x^  =  A  exp(ikB)  for  all 
k,  in  which  case  it  lies  on  O).  However  for  p  =  2,  N  =  3,  and  real  data,  the 

r 2 -  X1  +  X3 

zeros  of  (112)  lie  at  r  +*Jv  -  1,  where  r  = — - - .  So  if  Irl  <1,  both  zeros 

zx2 

lies  on  O,  whereas  if  Irl  >1,  one  zero  lies  outside  O.  Therefore,  the  com¬ 
ments  at  the  end  of  subsection  4.3  are  relevant  here  also. 


4.7  BURG  TECHNIQUE 


The  key  to  this  technique,  first  presented  in  reference  21,  is  the  observa¬ 
tion  from  equation  (A-6)  in  appendix  A  that  if  the  particular  p-th  order  coeffi¬ 
cient  a^  can  be  evaluated,  the  rest  of  the  p-th  order  predictive  filter  coeffi¬ 
cients,13  a  ^  ,  1  <  k  <  p  -  1,  could  be  evaluated  from  (p  -  l)-th  order 
coefficients .  This  relation  (A-6)  holds  true  for  the  solution  of  normal  equations 
(A-3)  even  if  jR^}  are  replaced  by  estimated  values.  Explicitly,  if  estimates 
ftp  . . . ,  Kp_i,  and  a  arc  considered  known  in  the  matrix  equation 


A 

R-i 

A 

.  •  •  R 

-P 

_ 1 

~p(pT 

A 

R 

o 

-r 

ii 

0 

A 

R 

o  _ 

• 

• 

-a(p) 
L  p  _ 

i 

“  °  i 

9 


(147) 


then  we  have  p  +  1  linear  equations 
P^.  (Notice  that  whereas  Rp  was 
tion  is  reversed  here  for  these  two 
P  >1,  by 


,(P) 


>> 

Vl’ 


V 


in  the  p  +  1  unknowns  aT 

(p)  1 

known  and  ap  unknown  in  (A-3),  the  situa- 

variables.)  The  solutions  are  given,  for 


(P)  _  a(P-l) 
k  "  *k 


<P>  a(P-D 
ap-k 


k  =  1,  2, 


p  -  1  (no  terms  if  p=l)  (148) 
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R 

P 


(149) 


p(P)  =  p(P-D 


(150) 


The  quantities  jpjJ  in  (149)  are  the  estimated  normalized  correlation  coeffi¬ 
cients  {Rk/R0|  •  The  recursion  (150)  is  started  with 


P 


N  2 

£  N  • 

n=  1 


(151) 


which  is  the  sample  power  of  the  available  samples.  A  method  of  evaluating 
aP^  for  p  >1  is  treated  below. 

The  method  presented  here  is  a  combination  of  references  21  and  7.  It 
begins  by  defining  zero-th  order  forward  and  backward  sequences  according  to 

f(0)  =  x  ,  b(0)  =  x  ,  1<  n  <N  .  (152) 

n  n  n  n 

The  p-th  order  forward  and  backward  sequences  (residuals)  for  p  >  1  are  de¬ 
fined  according  to 


{(P)  =  {(P-D 


n  n 


(P-1) 

n-1 


f(P-U 

n 


for  p  +  1  <  n  <  N  . 


(153) 


(These  can  be  interpreted  as  one-step  forward  and  backward  prediction  errors.) 
A  chain  interpretation  of  (153)  is  presented  in  figure  3.  (From  the  known 
correlation  results  in  subsection  3.  2,  if  we  define 


f®>  =  x  -  £  af  x  „  ,  b'»>  =  „  -£a®,)  x  v, 

n  n  £^1  k  n-k  n  n-p  k  n-p+k 

(P) 

we  find  that  figure  3  results,  with  g^  replaced  by  a^  . ) 
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Figure  3.  Chain  Interpretation  of  Burg  Technique 


The  average  magnitude-squared  value  of  the  p-th  order  forward  and  back 
ward  sequences  is 


F<p> ,  *  E  ( t® 

2(N-p)  .  \  n 


N 

E 

n=p  +  I 


+  b 


(P) 


*  £,  /|r(P-l)  „  h(P«l) 

2 (N-p)  E  \ In  "  gp  bn-l 


n=p  +  l 


2  ♦  lb4’-1'  -  g-  f(p-1»'2 

I  n-1  p  n 


,  P  >  1 . 

(154) 


We  wish  to  minimize  this  average  power  at  the  p-th  stage  by  choice  of  cross¬ 
gain  gp .  We  find  the  optimum  choice  to  be 


N 


V 


2  E  f(p‘1)  b®-1'* 

,  n  n-1  XT  ,  . 

n=p  +  1 _ _  Num(p) 


N 

E 

n  =  p  +  1 


r(p-D 


n 


+  b 


b(p-l)  \ 

n-1  / 


Den(p) 


.  P  >  1 . 


(155) 


43 


TR  5303 


When  (155)  is  substituted  in  (153),  the  results  are  called  the  residuals.  The 
minimum  value  of  the  residual  power  at  the  p-th  stage  is  obtained  by  substitu¬ 
ting  (155)  in  (154)  and  is  expressible  as 


Den(p) 
2(N-p)  ’ 


p  >  1 . 


(156) 


The  quantities  necessary  for  this  evaluation  are  available  when  (155)  is  evalu¬ 
ated.  The  value  of  (156)  will  never  be  smaller  than  (141),  since  the  procedure 
here  is  a  step-by-step  procedure,  not  a  simultaneous  procedure  as  used  in 
subsection  4.  6. 


An  immediate  recursion  for  the  transfer  functions  of  the  p-th  stage  in  fig¬ 
ure  3  is 


3  WW  -  3<p-l,(I)  -  v-l^l)W 


(157) 


with  starting  values 

3(0)(z)  -  I3(0)(z)  =  1.  (158) 


If  we  let  transfer  functions 


E  afz-k, 

k=  1 


BW<W  - 


p-i 


k=  0 


(p)*  -k  -p 
a  z  +z 
p-k 


-p  (p)*  -l 

z  (z  ) , 


(159) 


the  solution  is 


z? '  -  gn  .  P  >  1 .  (160) 

P  P 

with  the  lower  order  coefficients  given  by  (148).  Thus,  the  only  remaining 
quantity,  a^  ,  that  was  necessary  for  solution  of  (147)  -  (150)  is  given  by  (160) 
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and  (155),  along  with  (152)  and  (153).  To  the  three  lowest  orders,  the  solutions 
are  given  by 


p.o.  p«-*0.*£  IM 

n»  1 


(161) 


p  *  1,  a 


(1) 


R  .'W 

o  1 


p(1)  -  P<°>  (l  -  la^l2) 


(162) 


P  *  2,  a, 


(2) 


*2 


a<2)  .  a<l>  a<2>  Jl>’ 

*1  “l  "  *2  *1 


n  p  _(2)  a  (2) 

R2  R1  *1  *  Ro  *2 


p^.p'1’  2) 


(163) 


It  will  be  observed  that  for  p  *  1,  is  identical  to  (144);  in  fact,  the  proce¬ 
dures  are  identical  in  this  case.  It  should  also  be  noted  that  at  each  stage,  an 
estimate,  Rp,  of  the  true  correlation  value  Rp  becomes  available  via  (149),  and 
is  unchanged  by  the  addition  of  any  further  stages  (larger  p) . 


It  was  demonstrated  in  (A-  9)  that  the  magnitude  of  a^  was  bounded  by 
unity  if  the  known  correlation  matrix  R  was  nonnegative  definite.  The  same 
property, 


(164) 


(p) 

is  true  here  in  the  case  of  unknown  correlation  when  a^  is  determined  by  (160) 
and  (155);  see  appendix  F.  This  is  sufficient  to  show  that  all  zeros  of  (A- 10) 
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lie  inside  O;  see  reference  11,  for  example.  Therefore,  the  recursion  (149) 
can  be  used  in  the  form 

R,  -  E  Vk  «^>>.  1  >  P  +  1  ,  (165) 

k-1 

to  extrapolate  the  estimated  correlation  values  beyond  p-th  order,  with  the  p-th 
order  coefficients  {ajjjP^}  ^  ,  and  is  guaranteed  to  be  stable.  Division  of  (165)  by 
R0  yields  the  normalized  correlation  coefficients.  Recursion  (165)  is  similar 
in  form  to  those  encountered  in  (75),  (111),  and  (118). 

The  quantity  that  results  as  the  solution  (150)  of  matrix  equation  (1*7) 
is  not  the  minimum  average  magnitude-squared  error  as  it  was  for  known  cor¬ 
relation;  see  (A-3),  (A-7),  and  (67).  In  fact,  p(P)  has  no  direct  physical  sig¬ 
nificance;  it  is  merely  the  variable  left  over  in  that  position  in  the  normal  equa¬ 
tions  (147)  when  modified  from  the  case  of  known  correlation  values,  (A-3). 

(P) 

Rather,  Fq  in  (154)  and  (156)  is  the  minimum  average  magnitude- squared 
error  of  the  forward  and  backward  residuals,  (153),  of  the  available  data. 

Thus,  as  far  as  picking  an  "optimum"  value  of  p  at  which  to  terminate  the  re¬ 
cursion  in  (147)  -  (150)  is  concerned,  the  latter  quantity  has  more  physical  sig¬ 
nificance.  However,  the  two  quantities  are  very  close  to  each  other  for  no  bad 
data  points,  especially  for  N-p  large;  see  appendix  G.  Both  quantities  are 
readily  calculated  at  any  stage  via  (150)  and  (156),  respectively. 

The  transfer  functions  from  input  x  to  the  p-th  order  residuals  are  given  in 
(159).  Therefore,  the  spectra  of  the  residuals  are  given  by 

G^ff)  =  G^f)  =  |3(P)(exp(i2*fA>)|2  G^f)  .  (166) 

Now  if  the  chain  in  figure  3  has  been  carried  to  the  stage  where  further  values 
of  cross-gain  gp  would  be  substantially  zero,  then  the  residuals  are  approxi¬ 
mately  white.  Therefore,  an  estimate  of  the  input  spectrum  is  available  from 
(166)  and  (159)  according  to 


Gx® 


-  £  ajj^  exp(-i2irfk^) 
k=  1 


f  <  -=- 


1_ 

2-i 


(167) 
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where  the  residual  power  haa  been  aet  at  unity;  see  the  discussion  under  (108). 
Two  alternatives  to  this  scale  factor  are  discussed  in  appendix  H;  namely,  it  is 
shown  that  p(P)  and  are  both  meaningful  scale  factors  that  could  be  applied 
to  (167). 


The  estimated  correlation  values  in  (149)  are  generally  biased.  This  may 
be  anticipated  from  the  complicated  forms  of  (149),  (148),  and  (155),  si^ce 
additional  statistics  than  simply  xi^x;,  need  to  be  known  in  order  that  Rp  be 
capable  of  evaluation;  that  is,  Rp  depends  on  much  more  than  just  xic-p**  •  for 
the  Burg  method.  This  biasedness  may  be  proven  for  a  simple  example  with 
p  *  1,  N  =  3.  (R0  in  (151)  is  unbiased;  and  for  p  *  1,  N  ■  2,  we  find  R}  *  x2xj  , 
which  is  unbiased.)  For  real  data,  with  random  variables  {xj^}  being  zero- 
mean  unit-variance  Gaussian,  and  *2*1 »  'I  *3*1  -  we  find  (in 
~  1  12  -  2J3  1  ** 

appendix  I)  that  R,  *  +  —  - r - *  *  —  (.9484).  The  bias  is  slight  but  non- 

V2  9 

zero. 


In  summary,  the  Burg  algorithm  for  data  processing  consists  of  initializa¬ 
tion  (152);  followed  by  the  cross-gain  calculation  in  (155);  filter  coefficients  via 
(160)  and  (148);  and  normalized  correlation  coefficients  (149)  (if  desired) 
at  every  stage.  The  update  required  at  each  stage  is  given  by  (153),  and 
the  extrapolated  normalized  correlation  coefficients  at  any  stage  are  available 
from  (165),upon  division  by 

4.8  SUMMARY  OF  PROPERTIES  OF 
TECHNIQUES 

The  solution  for  the  filter  coefficients  in  the  techniques  considered  above 
can  be  put  in  the  form 


*(-  1)  »  F  t  . 
o 


(168) 


The  properties  of  the  estimated  correlation  matrix  a  (if  desired)  are  tabulated 
in  table  1.  (Actually,  several  of  the  "No"  entries  should  be  "Not  Necessarily.”) 

It  will  be  seen  that  none  of  the  techniques  possesses  a  "Yes"  for  all  the 
properties.  The  Yule-Walker  and  Burg  techniques  possess  everything  but  the 
unbiased  property;  however,  the  unbiased  property,  per  se,  of  the  correlation 
estimates  is  not  necessarily  a  desirable  feature  for  spectral  estimation,  as  will 
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Table  1.  Properties  of  Estim atari  Correlation  Matrices 


Correlation 

Technique 

Estimates 

Noon eg- 

stive 

Stable 

Toeplitz 

Definite 

Recursion 

Yule- Walker 

(114) 

No 

Yes 

Yes 

Yes 

Yes 

Unbiased 

Yule-Walker 

(116) 

Yes 

Yes 

Yes 

No 

No 

Least  Squares 
of  Box  and 
Jenkins 

(119) 

No 

Yes 

No 

No 

No 

Approximate 

maximum 

likelihood  of 
Box  and 
Jenkins 

(123) 

Yes 

Yes 

No 

No 

1 

No 

Prediction 

(126) 

Yes 

Yes 

No 

Yes 

No 

Forward  and 
Backward 

Prediction 

l 

(138) 

Yes 

Yes 

No 

Yes 

No 

Burg 

1 

(149) 

No 

Yes 

Yes 

Yes 

Yes 

be  seen  by  later  simulation  results.  On  the  other  hand,  simultaneous  satisfac 
tion  of  the  three  properties  of  Hermitian,  Toeplitz,  and  nonnegative  definite 
guarantees  that  a  stable  recursion  and  nonspiky  spectral  estimates  result,  see 
reference  1,  page  567. 
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5.  CORRELATION  UNKNOWN:  FINITE  DATA  SET 
WITH  BAD  DATA  POINTS 

In  some  application*,  some  data  values  can  be  bad  as  a  result  of  malfunc¬ 
tioning  equipment  or  human  errors  in  reading  or  recording,  for  example.  Also, 
some  data  values  can  be  missing  as  a  result  of  equipment  being  inadvertently 
or  intermittently  turned  off  for  calibration  purposes,  for  example;  or  some  sec¬ 
tions  of  data  can  be  contaminated  by  strong  burst-like  noise  and  be  virtually 
useless  in  those  sections.  All  of  these  problems  can  be  characterized  mathe¬ 
matically  by  saying  that  of  the  available  data  set  {xq}  ^  ,  the  values  Xq  for  the 
distinct  integers 

n  -  M2,  ...  Mfi  (169) 

are  known  to  be  bad  (or  missing) .  The  B  bad  locations  { M j }  ®  are  presumed 
to  be  known.  The  bad  data  points  can  be  regularly  spaced,  or  randomly  spaced, 
or  a  combination,  depending  on  the  application,  it  will  make  no  difference  to 
the  techniques  to  be  developed  here. 

In  this  section,  we  wish  to  estimate  the  input  spectrum  despite  the  presence 
of  known  bad  points.  The  last  two  methods  in  subsections  4.6  and  4.7  will  be 
extended  to  cover  this  case .  The  reason  we  do  not  extend  the  other  methods  in 
section  4  will  become  clear  when  we  compare  the  various  techniques  by  simu¬ 
lation  in  section  6. 

5.1  FORWARD  AND  BACKWARD  PRE¬ 
DICTION  USING  VALID  ERROR  POINTS 

The  method  to  be  presented  here  is  very  similar  to  that  given  earlier  in 
subsection  4.6;  accordingly  the  treatment  will  be  briefer.  For  convenience  and 
to  enable  a  better  estimation  of  the  true  spectrum  with  a  limited  number,  p,  of 
parameters,  we  subtract  the  sample  mean  of  the  N-B  good  data  points  so  that 

N 

N-B  £  xn  “  0  ’  (170A) 

n*  1 

where  $  denotes  that  we  skip  those  values  of  n  in  the  set  (169);  that  is,  we 
simply  ignore  the  bad  data  points  —  this  is,  in  fact,  the  main  theme  of  the 
methods  to  be  presented.  We  attempt  no  interpolation  on  the  bad  points,  nor  do 
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we  set  them  equal  to  zero  or  the  sample  mean.  We  also  scale  the  good  points 
so  that  the  sample  variance  is  unity: 


1 

N-B-l 


=  1. 


(I70B) 


This  helps  avoid  overflow  and  underflow  problems  in  the  numerical  manipulation 
of  large  arrays  encountered  for  large  p. 

A  forward  prediction  of  x^  is  afforded  by 

P 

L  a  xkn*P  +  1^k^N*  (171) 

n=  1 

provided  that  k-1,  k-2,  ....  k-p  4  M].,  M2,  ....  MB.  Then  a  valid  forward 
error  can  be  defined  as 


W*k"  £  anVn  V-l>,P  +  l<k<N,  (172) 

n=  0 

provided  that  k,  k-1,  . . . ,  k-p  ^  Mj_,  M2,  . . . ,  Mg  ;  that  is,  /t^  is  defined  for 
p  + 1  <  k  <  N  except  for  k  in  the  set  of  integers 


I  : 
P 


Ml- 

M1  +  lf 

•  •  •  t 

Ml 

+  P  \ 

M2, 

• 

m2  +  1’ 

•  •  •  t 

M2 

+  P 

• 

• 

V 

mb+1* 

•  •  •  f 

mb 

+  P 

(173) 


If  any  numbers  in  set  Ip  are  <p  +  l  or  >  N,  they  are  not  encountered  in  the  error 
definition  (172).  Let  Bp  denote  the  number  of  distinct  integers  in  Ip  which  are 
>  p  +  1  and  <N;  this  is  the  number  of  gaps  (bad  points)  in  the  error  sequence 
(172). 
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We  now  define  an  average  forward  error  over  the  valid  error  points  as 


A 

F  = 


1 

N-p-B 

P 


N 

E 


k=p  +1 

k$‘P 


(174) 


where  $  denotes  "not  contained  in, "  and  N  -  p  -  Bp  is  the  number  of  terms  in  the 
sum.  Substituting  (172)  in  (174),  we  obtain 


F  = 


P 

E 

m,n=  0 


a  a 
m  n 


N-p-B 


N 

E 


P  k=p  +  l 


VrnVn* 


(175) 


A  backward  prediction  of  x^  is  available  as 

P 

V  E  >;*k+n.  l<k<N-p.  (176) 

n=  1 


provided  that  k+l,  k+2,  ....  k+p^M^,M2,  ....  Mg.  And  a  backward  error 

P 

‘k  3  \  ~  Xk  =  S  an  \+n  (a0=_1)  *  1  ^  k  -  N  ~  P  >  (177) 

n=  0 

is  available  if  k,  k+l,  . . . ,  k+p  ^  Mi,  M2,  ....  Mg.  Letting  i  =  k+p  in  (177), 
we  can  write 


« 

t 


■P 


P 

E 


n=  0 


a* 

n 


x 


f-p+n’ 


p  +  1  <  j2  <  N, 


(178) 


if  J2  ia  not  contained  in  the  set  Ip  defined  in  (173).  The  we  can  define  an  average 
backward  error  over  the  valid  error  points  as 
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V 

F 


N 


N-p-B 


P  t 
I 


P 

r'  *  1 

“  am  an  N-p-B 
m,n=  0  p 


N 

£ 


/=p  +  l 


l-p+n 


x 


* 

l-p+m  * 


(179) 


(180) 


where  we  have  substituted  (178). 

We  are  now  in  a  position  to  define  an  overall  average  error  as 

1  A  V 

F  *  |  (F  +  F)  * 
where,  from  (175)  and  (180), 

N 

s  =  1  y 

nm  2  (N  -  p  -  B  ) 

P  k=p  + 1 

k/lp 

It  should  be  noticed  that  (182)  does  not  tell  us  merely  to  sum  over  the  "good" 
products,  but  rather  to  exclude  set  Ip.  The  number  of  terms  in  the  sum  (182) 
is  the  same  for  0<n,  msp  and  Is  N-p-Bp.  (For  no  bad  points,  (182)  reduces  to 
(138).)  Two  useful  properties  of  Snm  are 

S  =  S*  ,  S  =  S*  .  (183) 

mn  nm  p-n ,  p-m  nm 

The  quantity  Snm  is  an  unbiased  estimate  of  Rn_m;  however,  the  presence  of 
bad  points  will  increase  the  variance  of  Snm;  see  reference  5.  The  matrix 
l  Snm)  ^  is  Hermitian  and  nonnegative  definite. 

The  optimum  predictive  filter  coefficients  ^  are  obtained  by  mini¬ 

mizing  (181): 


y  a  a*  S 
^  m  n  nm 
m,n*0 


(181) 


(x,  x?  +  x  x*  \.  (182) 

\  k-m  K-n  k-p-t-n  k-p-t-m ) 
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Sm 


sio.  1<J  <P. 


The  minimum  average  error  is 


F  =  S 

O  00 


-Vs* 

n 


m*  1 


a 

mo  m 


And  since  the  sample  variance  of  the  good  data  points  was  set  equal  to  unity  in 
(170B),  (185)  is  a  relative  error  measure  that  can  be  used  to  decide  what  value 
of  p  should  be  used  in  (171)  and  (176);  see  reference  1,  equations  (41)  and  (89) 
et  seq.  The  spectral  estimate  we  adopt  is  obtained  by  substituting  the  solution 
of  (184)  into  (108),  as  usual.  The  quantity  F0  in  (185)  could  be  used  as  a  scale 
factor,  if  desired,  according  to 

E  =  F  .  (186] 

o  o 

5.2  BURG  TECHNIQUE 

The  problem  setting  is  the  same  as  that  for  the  previous  subsection,  in¬ 
cluding  (169)  -  (170).  The  solution  is  identical  to  that  for  subsection  4.7,  up  to 
(150).  Now  we  define  zero-th  order  forward  and  backward  sequences  as 


f(°>  _  v  h(°)  _v  i  .  .  v  k  T 

i  * ■  x  j  d  —  x  ,  1  ^  n  N  j  n  n  I  • 

n  nn  n  to 


where  we  again  employ  the  definition  (173).  The  first-order  sequences  are  de¬ 
fined  as 


f<‘>  =  f<°> . .  „<»> 

n  n  1  n-1 


K(1>  _  h(0)  *  f(0) 
b  =  b  .  -  g,  f 


2  <  n  £  N  ,  n  £  , 


where  the  restriction  of  set  1^  is  due  to  the  fact  that  the  first-order  sequences 
cannot  be  formed  (evaluated)  in  set  1^.  We  choose  cross  gain  to  minimize 
the  average  error 
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F(1)  S 


2(N 


ibv  i  (lfl‘  -  K‘f )• 

t  n  =  2 


(189) 


4h 


where  N-l-B^  is  the  number  of  terms  in  the  sum.  The  solution  is  given  by 

2  £  f<01b<0)* 

n-2  1  “-1 


h '  i  n: 

•%  =5  9  ' 


+  b 


(0) 

n-1 


(190) 


n=  2 

41! 

With  this  value  of  g1(  we  can  now  compute  values  for  residuals  fb  ,  bl^  in 
(188)  and  continue  the  procedure. 

At  stage  p ,  we  have 

f(p)  =  f(p-D  _  b(p-D 
n  n  6p  n-1 


b«» ,  b(p-D  .  g.  f(p-» 


for  p  +  l<n<N,n^I 


(191) 


n-1 


P  n 


The  choice  of  cross-gain  gp  that  minimizes  average  error 


,(P)  _ 


5  n^r/l£",| 

n^  Ip 


+  b 


(P) 


(f(0)  =  l),  (192) 
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is 


N 


2  L 


f<p-D 

n 


Num(p) 
Den(p)  * 


and  the  minimum  value  of  (192)  can  be  expressed  as 


L  |2\  -PPPfe > _ 

|Bp|  )  2(N-p-B  ) 


(193) 


(194) 


This  is  also  a  relative  error,  due  to  the  normalization  (170B),  and  can  be  used 
as  an  indicator  when  to  terminate  the  recursion  procedure  in  (191). 


It  may  be  seen  from  (192)  and  (193)  that  the  sums  are  merely  taken  over 
those  values  of  n  where  the  summands  are  defined.  The  number  of  terms  in  all 
the  sums  is  N-p-Bp. 

As  in  subsection  4.7,  the  filter  coefficients  are  given  by 


v p  -1, 


(195) 


and  for  p  >  2 ,  by 


(P-D 

-k 


1  <  k  <  p 


1 . 


(196) 


Equations  (147)  through  (150)  still  hold  true.  The  starting  value  of  P(°)  is  now 
1,  by  virtue  of  normalization  (170B).  Recursion  (165)  for  i  >p  +  l  is  still  valid 
and  is  stable  since 


<  1, 


(197) 


as  may  be  seen  from  (193)  and  appendix  F.  The  spectral  estimate  is  again 
given  by  (167).  The  discussions  in  appendixes  G  and  H  are  relevant  here  also. 


55/56 

REVERSE  BLANK 


TR  5303 


6.  COMPARISONS 


All  the  techniques  considered  in  section  4  will  now  be  compared  in  terms  of 
their  resolution  capability,  bias,  and  statistical  stability,  by  means  of  a  simu¬ 
lation  approach.  In  particular,  the  fourth-order  autoregressive  process  which 
was  intensively  investigated  in  reference  2  (see  figures  4a  and  5a)  will  be  the 
basic  process  of  interest  here  also.  It  is  characterized  by 


4 

*k  =  £  Vk-n  +  V 

n=  I 


(198) 


where 


a  =  2.7607,  a  =  -  3.8106,  a  =  2.6535,  a  =  -  0.9238,  (199) 

i  ^  u  4 

and  where  {w^}  is  Gaussian  white  noise.  We  restrict  consideration  to  real 
processes  here.  We  will  not  address  the  problem  of  how  best  to  pick  the  value 
of  p  used  in  the  techniques  of  sections  4  and  5,  but  shall  instead  set  p  equal  to 
the  known  value,  4,  and  concentrate  on  the  ability  of  the  various  techniques  to 
estimate  the  parameters  in  (199),  and  thereby  the  spectrum  of  jx^j  ,  from  a 
finite  set  of  N  data  points. 

The  simulation  method  consists  of  the  generation  of  100  independent  reali¬ 
zations  of  the  process  in  (198)  in  steady  state.  The  coefficients  in  (198)  are 
estimated  for  each  of  the  100  realizations,  and  the  corresponding  100  estimated 
spectra  are  computed  by  means  of  (108),  for  every  technique  in  sections  4  and 
5.  The  examples  to  be  considered  are  summarized  in  table  2,  where  N  is  the 
number  of  data  points  in  each  realization  (trial),  and  B  is  the  number  of  bad 
points  in  each  realization.  The  corresponding  figures  are  collected  together  at 
the  end  of  this  section. 

6.1  NO  BAD  DATA  POINTS 

In  figure  4A,  the  100  different  estimated  spectra,  one  for  each  of  the  100 
independent  trials,  are  plotted  for  the  Yule- Walker  approach,  and  for  N  =  40 
data  points.  In  figure  4B,  the  (power)  average  spectrum  of  the  100  estimated 
spectra  is  plotted,  along  with  the  true  spectrum  of  process  (198)  and  (199). 

The  true  spectrum  is  scaled  so  that  its  area  is  equal  to  that  of  the  average 
spectrum.  It  will  be  seen  from  figure  4A  that  there  is  a  great  deal  of  variabil¬ 
ity  in  the  individual  spectral  estimates.  From  figure  4B,  we  observe  that  the 


57 


TR  5303 


Table  2.  Simulation  Examples 


Figure  Number  of  Number  of 

Number  Data  Points  Bad  Points 

N  B 


Yule-Walker 

Yule- Walker,  Unbiased 

Least  Squares  of  Box  and  Jenkins 

Approximate  Maximum  Likelihood 
of  Box  and  Jenkins 

Prediction,  Valid  Error  Points 

Forward  &  Backward  Prediction 

Burg 

Burg,  Uniform  Noise 

Forward  &  Backward  Prediction 

Burg 

Forward  &  Backward  Prediction 
Burg 

Forward  &  Backward  Prediction 
Burg 

Forward  &  Backward  Prediction 
Burg 

Forward  &  Backward  Prediction 
Burg 


58 


TR  5303 


average  spectrum  does  not  resolve  the  two  narrowband  peaks  of  the  true  spec¬ 
trum*;  in  fact,  this  same  conclusion  is  true  for  the  individual  spectra  in  figure 
4A.  A  severe  bias  exists  in  the  skirts  of  the  average  spectrum,  which  gives  a 
gross  overestimate  of  the  power  in  bands  away  from  the  peaks.  Thus,  the 
Yule- Walker  approach  has  poor  resolution,  severe  bias,  and  substantial  vari¬ 
ability. 

The  corresponding  results  for  the  unbiased  version  of  the  Yule-Walker  ap¬ 
proach  are  displayed  in  figure  5.  Rather  than  improving  the  situation,  it  is 
found  that  the  spectral  estimates  are  worse  in  every  regard.  The  spectral 
estimates  with  strong  spikes  near  f  =  are  manifestations  of  pole-pair  loca¬ 
tions  of  estimate  (108)  that  are  very  close  to  the  unit  circle  O.  Recall  from 
subsection  4.2  that  the  zeros  of  (112)  need  not  lie  inside  O;  see  the  discussion 
below  (118). 

The  unbiased  correlation  estimates  utilized  above  in  the  normal  equations 
are  of  the  same  form  as  those  suggested  in  reference  5  for  missing  data,  when 
spectral  estimation  is  attempted  directly  via  (2).  But  since  the  performance  of 
these  unbiased  correlation  estimates  is  so  poor  here,  they  are  not  considered 
worthwhile  in  the  presence  of  bad  data  points,  when  spectral  estimation  is  ac¬ 
complished  via  (108).  Whether  they  are  worthwhile  for  use  in  (2)  is  not  known. 

Results  for  the  least-squares  approach  of  Box  and  Jenkins  are  given  in  fig¬ 
ure  6.  The  variability  is  less  than  that  for  the  Yule-Walker  estimates  in  figure 
4A.  And  some  resolution  is  achieved  in  figure  6B,  in  addition  to  good  skirt 
selectivity.  There  is  still,  however,  a  large  number  of  spiky  spectral  esti¬ 
mates,  as  anticipated  in  the  discussion  under  (121). 

Conditions  are  not  much  improved  for  the  approximate  maximum  likelihood 
method  of  Box  and  Jenkins  presented  in  figure  7.  There  happens  to  be  one  par¬ 
ticular  spectral  estimate  with  a  very  large  spike  (a  zero  virtually  on  O)  that 
severely  influences  the  average  power  level.  The  variability  in  the  estimated 
skirt  level  is  quite  small  for  this  technique  (as  well  as  for  the  previous  one). 

In  figure  8,  the  results  for  prediction  using  only  the  valid  error  points  are 
presented.  The  resolution  and  bias  in  figure  8B  are  observed  to  be  very  good, 


♦This  same  conclusion  is  reached  in  reference  2,  figure  5b,  for  the  same 
number  of  data  points.  Increasing  p  (above  4)  does  recover  some  of  the  resolu¬ 
tion  of  the  two  narrowband  peaks,  but  it  does  not  reduce  the  severe  bias  of  the 
Yule-Walker  approach. 
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and  except  for  some  spiky  estimates  in  figure  8A,  the  variability  of  the  individ¬ 
ual  estimates  is  fairly  small. 

The  situation  is  still  better  when  we  consider  forward  and  backward  predic¬ 
tion,  using  only  the  valid  error  points,  in  figure  9.  There  are  a  couple  of  spiky 
estimates,  but  they  are  not  excessively  large,  as  they  were  previously.  The 
bias  and  resolution  are  very  good  in  figure  9B.  Although  the  zeros  of  (112)  need 
not  remain  inside  O  for  this  technique,  it  was  found  that  in  all  100  trials,  no 
zeros  were  ever  located  outside  of  O.  The  presence  of  the  two  spiky  estimates, 
however,  indicates  that  on  two  occasions, a  zero  came  close  to  the  periphery 
of  O. 

One  of  the  major  drawbacks  of  this  technique  is  the  need  to  invert  a  non- 
Toeplitz  matrix  (or  an  equivalent  operation)  in  order  to  evaluate  the  optimum 
filter  coefficients,  see  (140).  For  large  p,  this  is  a  significant  numerical  prob¬ 
lem.  We  therefore  attempted  to  convert  the  matrix  tSfm]  in  (140)  to  a  Toeplitz 
matrix,  so  that  the  recursive  solution  in  appendix  A  could  be  utilized.  We  first 
averaged  [S|mJ  ^  down  the  diagonals  and  left  the  right-hand  side  of  (140)  as  is; 
however,  we  lost  resolution  and  got  badly  biased  and  more  variable  spectral 
estimates.  Next  we  diagonally  averaged  ISjm]  P  and  left  the  right-hand  side  of 
(140)  alone,  but  got  the  same  bad  effects.  Finally,  we  diagonally  averaged 
|Sfm)  P  and  also  replaced  the  terms  on  the  right-hand  side  of  (140)  by  the  ap¬ 
propriate  averages,  but  again  to  no  avail.  Thus,  we  are  unable  to  significantly 
modify  (140)  without  dire  effects  on  the  spectral  estimate. 

Finally,  when  the  Burg  technique  is  considered  in  figure  10,  we  observe  the 
complete  absence  of  spiky  estimates,  this  is  due  mainly  to  the  guaranteed  loca¬ 
tion  of  the  zeros  of  (A-10)  inside  O.  In  other  respects,  the  results  of  figures  9 
and  10  are  very  similar.  There  is  a  small  bias  in  figure  LOB,  with  the  peaks 
being  rounded  off  and  the  valley  filled  in;  this  is  similar  to  figure  5  in  refer¬ 
ence  2. 

All  the  results  above  have  been  conducted  for  Gaussian  white  noise  jwjJ  in 
(198).  To  see  the  effect  of  the  statistics  of  jw^j  upon  the  spectral  estimates, 
we  changed  to  a  uniform  distribution.  The  results  in  figure  11  are  virtually 
identical  to  those  in  figure  10.  Accordingly,  Gaussian  statistics  are  kept  for  the 
remainder  of  the  simulation. 
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6.2  BAD  DATA  POINTS 

By  virtue  of  the  results  of  the  preceding  subsection,  further  consideration 
is  limited  to  the  forward  and  backward  prediction  technique  and  the  Burg  tech¬ 
nique.  The  first  example  we  consider  is  B  ■  4  bad  data  points  out  of  a  total  of 
N  ■  40  data  points;  that  is,  in  each  of  the  100  realizations  of  40  data  points,  4 
points  (no  more,  no  less)  were  randomly  selected  as  being  bad,  and  the  corre¬ 
sponding  values  of  were  suppressed.  In  some  of  the  realizations,  the  four 
data  points  may  have  been  close  together  (for  example,  10,  12,  14,  15),  whereas 
in  other  realizations,  they  might  have  been  far  apart  (for  example,  1,  14,  27, 
40). 


The  resulting  spectral  estimates  are  given  in  figures  12  and  13.  The  vari¬ 
ability  in  the  skirts  is  less  for  the  forward  and  backward  prediction  technique 
than  for  the  Burg  technique .  However ,  the  spiky  nature  of  the  former  technique 
is  quite  evident  in  comparison  with  the  latter  technique.  Both  techniques  have 
suffered  a  significant  loss  of  resolution  near  the  narrowband  peaks. 


The  reason  for  the  significant  degradation  in  performance  of  both  tech¬ 
niques  is  that  although  only  B/N  *  4/40  (10%)  of  the  points  are  bad,  the  number 
of  valid  error  points,  N-p-Bp  in  (174)  and  (192),  can  decrease  significantly. 
For  example,  for  p*4  and  spaced  bad  points  at  *  11,  M2  ■  16,  M3  *  21, 

M4  =  26  (see  (169) ),  we  have 


B  *  20 ,  N  ~  p  -  B  ■  16 . 
P  P 


(200) 


On  the  other  hand,  for  contiguous  bad  points  at  M^  «  1,  M2  *  2,  M3  *  3,  M4  = 
4,  we  have 


B  *4,  N-p-B  *  32  .  (201) 

P  P 

Thus,  anywhere  from  16  to  32  valid  error  points  can  be  achieved.  The  stability 
of  the  spectral  estimate  for  (200)  will  be  less  than  that  for  (201).  Generally, 
contiguous  bad  points  are  less  damaging  than  spaced  bad  points,  because  more 
valid  error  points  can  be  formed  when  the  bad  points  are  contiguous. 

One  of  the  points  of  the  above  example  is  that  4  bad  data  points  out  of  40  is 
rather  detrimental.  We  consider  now  N  ■  100  data  points.  The  first  example 
of  interest  will  serve  as  a  comparison  case  and  is  B  »  0.  The  results  of  spec¬ 
tral  estimation  for  the  forward  and  backward  prediction  technique  and  Burg 
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technique  are  given  in  figures  14  and  15,  respectively.  The  results  are  virtu¬ 
ally  identical;  there  is  excellent  resolution  and  almost  no  bias  for  both  tech¬ 
niques  . 

When  B  is  increased  to  10,  the  results  in  figures  16  and  1?  are  obtained. 
Despite  10%  bad  points,  good  performance  in  terms  of  stability,  bias,  and  reso¬ 
lution  is  attained.  The  number,  N-p-Bp,  of  valid  data  points  can  vary  from  46 
to  86;  however,  the  likelihood  of  realizing  as  few  as  46  on  a  random  basis  is 
very  remote.  The  Burg  technique  has  less-spiky  estimates  near  the  narrow- 
band  peaks,  as  expected,  however,  it  is  more  variable  in  the  skirts  than  the 
forward  and  backware  prediction  technique . 

When  B  is  increased  to  20,  the  results  in  figures  Is  and  19  indicate  that  the 
Burg  technique  has  more  variability,  but  is  less  spiky  and  has  better  resolution. 
The  same  conclusions  hold  true  for  B  »  30  in  figures  20  and  21.  however, 
neither  technique  resolves  the  two  narrowband  peaks  for  this  many  bad  data 
points. 


Figure  7.  Approximate  Maximum  Likelihood  of  Box  and  Jenkins; 


True  Spectra 


100  Spectral  Estimates  B.  Average  and  True  Spectra 
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Figure  9.  Forward  and  Backward  Prediction; 


100  Spectral  Estimates  B.  Average  and  True  Spectra 


Figure  10.  Burg;  N  =  40, 


Figure  11.  Burg,  Uniform  Noise; 


100  Spectral  Estimates  B.  Average  and  True  Spectra 


Figure  12.  Forward  and  Backward  Prediction; 


100  Spectral  Estimates  B.  Average  and  True  Spectra 


Figure  13.  Burg; 


00  Spectral  Estimates  B.  Average  and  True  Spectra 


TR  5303 


WISN30  H3M0d 


74 


FIEOUENGY 


100  Spectral  Estimates  B.  Average  and  True  Spectra 


FREQUENCY  FREQUENCY 

Figure  16.  Forward  and  Backward  Prediction;  N  =  100,  B  =  10 


100  Spectral  Estimates  i  B.  Average  and  True  8pectra 
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Figure  17.  Burg;  N  =  100, 


100  Spectral  Estimates  p.  Average  and  True  Spectra 
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Figure  18.  Forward  and  Backward  Prediction;  N  =  100,  B  =  20 


100  Spectral  Estimates  II.  Average  and  True  Spectra 


TR  5303 


WISN30  H3M0d 


79 


FREQUENCY  FREQUENCY 

Figure  20.  Forward  and  Backward  Prediction;  N  100,  B  -  3(1 
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7.  DISCUSSION  AND  CONCLUSIONS 

Several  methods  of  spectral  estimation  via  linear  predictive  techniques  have 
been  considered  for  a  univariate  process,  both  with  and  without  bad  data  points; 
the  bad  points  can  be  regularly  spaced,  randomly  spaced,  or  a  combination. 

Two  particular  methods  have  been  found  to  have  better  performance  than  the  re¬ 
mainder,  namely,  the  forward  and  backward  prediction  technique  and  the  Burg 
technique.  The  former  technique  tends  to  have  less  variability  on  the  skirts, 
but  has  more  spiky  estimates  near  the  peaks  of  the  spectrum;  the  latter  tech¬ 
nique  has  very  few  spiky  estimates.  Both  techniques  have  comparable  resolu¬ 
tion  and  bias. 

Since  the  best  choice  of  filter  order,  p,  is  not  known  a  priori,  it  is  neces¬ 
sary  in  practice  to  make  several  guesses  at  this  parameter  and  compute  some 
error  criterion  that  indicates  when  to  terminate  the  recursion.  In  particular, 
Akaike's  Information  Criterion  (reference  22)  is  often  adopted  as  a  termination 
procedure;  it  takes  the  form  (reference  1,  equations  (91)  and  (41)  or  reference 
22,  page  719) 

AIC  =  in  (Relative  Error)  +  -0—  (AIC  (p  =  0)  =  0)  ,  (202) 

A 

e 


where  Ne  is  the  "effective"  number  of  data  points,  and  is  taken  as  N-p  (or 
N-p-Bp  for  bad  points)  here,  at  the  p-th  stage.  The  value  of  p  at  which  (202) 
is  a  minimum  is  taken  as  the  best  estimate  of  this  parameter;  however,  criterion 
(202)  is  not  absolute,  and  the  user  can  adjust  it  to  fit  his  application  (reference  1, 
page  575).  A  wide  range  of  values  of  p  may  have  to  be  investigated  if  little  is 
known  about  the  true  spectrum  a  priori;  an  upper  bound  on  p  is  given  by  Akaike 
as  3N1/2  (Ibid). 


One  of  the  ramifications  of  this  successive  guessing  procedure  is  that  for 
the  forward  and  backward  prediction  technique,  a  different  pxp  matrix  [Sj^)  P 
mast  be  inverted  (or  an  equivalent  operation  conducted)  at  each  stage  (see 
(140)  and  (138) )  in  order  to  determine  the  filter  coefficients  and  minimum 
error,  (141).  Although  the  matrix  terms  can  be  updated  according  to 


X  X*  +  x  X 

s(p+l)  _  N-p  g(p)  _  p-t-l-m  p+l-n  N-p+n  N-p+m 
nm  N-p-1  nm  2(N-p-l) 


(203) 


in  addition  to  the  relations  in  (139),  the  size  of  the  matrix  [Sj^J  ^  grows  with 
p,  and  the  solution  of  (140)  can  be  a  time-consuming  procedure,  if  many  large 
values  of  p  mast  be  investigated.  This  fact,  coupled  with  the  fact  that  this 
spectral  estimation  technique  can  yield  spiky  estimates  and  an  unstable  recur- 
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sion  relation  (149),  leads  to  the  conclusion  that,  of  the  methods  considered,  the 
Burg  technique  is  the  recommended  procedure  for  spectral  analysis  of  univari¬ 
ate  processes.  A  comparison  with  the  maximum  likelihood  technique  (reference 
23)  is  underway  and  will  be  documented  in  a  future  report. 

The  solution  for  the  filter  coefficients  in  the  Burg  technique  is  accomplished 
recursively  as  shown  in  subsection  4.7  and  automatically  progresses  through 
successively  larger  values  of  p  at  which  error  measures  (150)  and  (156)  are 
readily  calculated.  There  is,  of  course,  the  need  to  update  the  forward  and 
backward  residuals  via  (153),  and  the  calculation  of  cross-gain  gp  in  (155),  both 
of  which  take  time  to  effect.  But  the  effort  required  actually  decreases  as  p 
increases,  since  fewer  terms  are  involved  in  (153)  and  (155);  in  exchange,  the 
stability  of  the  estimates  also  decreases. 

FORTRAN  programs  for  the  Burg  technique,  both  with  and  without  bad  data 
points,  are  given  in  appendix  J.  Some  representative  execution  times  on  the 
Univac  1108  for  the  computation  of  the  filter  coefficients  (SUBROUTINE  BURG) 
are  given  in  table  3,  where  N  is  the  number  of  data  points  and  PMAX  is  the 
maximum  order  of  filter  considered.  The  times  are  approximately  linearly 
proportional  to  N  and  PMAX.  The  execution  time  for  the  evaluation  of  the 
power  density  estimate  itself  is  governed  by  the  FFT  technique  employed  to 
evaluate  (167)  (SUBROUTINE  POWERS). 

Table  3.  Execution  Times;  No  Bad  Data  Points 


N 

PMAX 

. 

Time  (sec) 

100 

10 

0.038 

100 

20 

0.073 

1000 

10 

0.33 

1000 

50 

1.78 

10000 

50 

17.9 

10000 

150 

48.  4 
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The  presence  of  bad  data  points  is  easily  accommodated  in  the  Burg  tech¬ 
nique,  as  shown  in  subsection  4.7.  If  the  bad  data  points  are  contiguous,  the 
loss  in  stability  of  the  estimates  is  not  as  great  as  when  the  bad  data  points  are 
spaced.  The  worst  possible  locations  of  bad  data  points  occur  when  the  closest 
spacing  is  >p  +  1,  since  each  bad  data  point  causes  the  loss  of  .p  +  1  valid  error 
points.  Interpolation  of  spaced  bad  data  points  has  proven  poorer  than  the  tech¬ 
nique  utilized  here  (of  ignoring  bad  points)  when  the  spectral  content  of  the  input 
process  extends  fairly  close  to  the  Nyquist  frequency  (2  A)-*.  Since  the  exact 
extent  of  the  input  spectrum  is  unknown  a  priori,  interpolation  can  be  a  damag¬ 
ing  procedure  in  some  cases. 

The  spectral  estimation  technique  investigated  here  is  particularly  advan¬ 
tageous  for  short  data  segments,  where  other  methods  are  inapplicable.  For 
example,  if  a  piece  of  equipment  fails  frequently,  short  disjointed  pieces  of  data 
may  be  all  that  are  available.  Or  if  a  process  is  nonstationary,  it  may  be  nec¬ 
essary  to  cut  the  total  data  record  into  small  segments  in  each  of  which  it  is 
believed  that  conditions  are  substantially  stationary.  For  longer  data  records, 
where  standard  FFT  techniques  can  be  applied,  it  has  been  recommended  that 
both  spectral  estimation  procedures  be  applied  and  the  results  plotted  together 
to  glean  maximum  information  about  the  true  spectrum  (see  reference  12). 

This  seems  particularly  useful  when  some  pure  tones  are  present  in  the  input 
data;  the  standard  FFT  technique  is  ideally  suited  for  the  analysis  of  pure  tones 
or  very  narrowband  components. 


83/84 

REVERSE  BLANK 


TR  5303 


Appendix  A 


RECURSIVE  SOLUTION 


If  we  employ  (52)  in  (46),  there  results 


P  “k  1 

E  r  —  5,  .  £  p- 

X-k  a  c  Xo 
k=0  o  oo 


(A-l) 


Now  define 


0<ksp, 

Of  K 

O 


(A-2) 


where  the  dependence  of  the  coefficients  on  the  order  p  in  (31)  is  indicated  ex¬ 
plicitly.  Then  (A-l)  becomes 


R  R  , 
o  -1 


R,  R 
1  o 


(A-3) 


where  the  matrix  R  is  Hermitian  and  where  we  have  also  indicated  that  the  real 
quantity  cOQ  is  dependent  on  p;  see  (47)  and  (51).  Equation  (A-3)  constitutes 
p+1  linear  equations  in  the  p+1  unknowns  ajP) . a^p),  1/c^  . 

The  solution  to  (A-3)  can  be  obtained  recursively  as  follows  (see,  for  ex¬ 
ample,  reference  11  or  reference  24,  appendix  B): 

‘l1’  *  VRo  •  117  -  Ro  -  R-1  *lW  *  Ro  (l  -  K1  2) ;  <A-4» 
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for  p  >  2 : 


P-1 

R  -  £  a»-1)R  . 

P  te-1  2  p 

Ro  -  P£  ^  k 

k=  1 


=  [a  -  PE  a«-11  R  J  e*”1’  , 
l  k=l  j 


(A- 5) 


^  “  af  af-k  }  *  k  =  1*  2. 


(A- 6) 


(p)  Ro 


Z  <  =  R 


«W  R* 
P  P 


(A-7) 


The  last  step  in  (A-7)  is  obtained  by  substituting  (A-6)  and  employing  (A-5).  It 
is  very  important  to  notice  from  (A-6)  that  once  a^3)  is  specified,  all  the  p-th 
order  filter  coefficients  can  be  calculated  from  (p-l)th  order  coefficients.  The 
same  is  true  of  (A-7). 

If  we  use  (A-2)  and  (53),  the  maximum  entropy  spectrum  in  (55)  can  be  ex¬ 
pressed  as 


"nr 


£  exp(-i2ff  fkA) 

k=  0 


T”  »  If  i  <  TT- 
12 


(A-8) 


The  similarity  in  form  to  (14)  will  be  complete  when  it  is  shown  (in  (67) )  that 
l/cjg  is  the  minimum  value  of  the  average  magnitude-squared  error  for  a  p-th 
order  predictive  filter;  therefore  must  be  positive  for  all  p,  for  non-negative 
definite  R.  Equation  (A-7)  offers  a  recursive  calculation  of  the  average  error;  it 

can  be  started  with  "  (p)  =  RQ  .  (In  fact,  (A-5)  through  (A-7)  can  be  used  for  p  >  1 
coo 

when  that  starting  value  is  used. ) 
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Since  must  be  positive  for  all  p,  (A-7)  indicates  that 

a^  <  1  for  k  =  1,  2,  ....  p.  (A-9) 

This  is  equivalent  to  having  all  the  zeros  of 

V"  a(P)  2"^  (A-10) 

2.  ak  2  • 

k=  0 

(where  the  remaining  coefficients  are  determined  via  (A-6) )  inside  the  unit  cir¬ 
cle,  O,  in  the  complex  z-plane;  see  reference  1,  page  567.  Therefore 


BW»  s  £  zk 

k=  0 


has  no  zeros  inside  O. 


(A-ll) 
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But 


sir  /-f  [*n  aP *  I>  <z  -  v] 
fin  “p  *  £  57/4 in  <2  -  vl 


(B-6) 


in  (2  -  0^)  =  in  (-  0^)  +  in  (1  -  ~) 

k 


=  „n  o  >  -  - 

k 


(B-7) 


that  is,  expansion  (B-7)  converges  for  ! z I  <  lo^l  .  But  since  oki  >L,  the  region 
of  integration  in  (B-6)  remains  in  the  convergence  region  of  (B-7).  Therefore, 
the  integral  in  (B-6)  is 


Then  from  (B-6)  and  (B-4) 


:1  * 


£  *  -  in  a  +  £  in  (-  o  ) 


k=  1 


=  -  in 


hi,'--’] 


in  B(0)  =  -  in  a  . 

o 


(B-9) 


And  from  (B-l)  and  (B-4) 


£  =  -A  I  df  in  >*(f)  =  -  ^  j  df  in |  V  a*  &\p(i2 t fk A)l 

2  \n  Vi  K.o  k  S 
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*k=  0  f 

■  -  [**  +  £  i“(i  -  0] 

-  -  [*» «; +  £  *<*  -  \)]  •  <»-«> 

Now 


°0 + in  ^ 

2 

=  in  (-  o*)  -  — j-  -  / \  -  ...  for  <  1 ; 

l  k ;  o  z  ^o*z/  o*z 


(B-ll) 


that  is,  expansion  (B-ll)  converges  for  Izl  >-=*-.  But  since  loj^i  >1,  the 

lo^  | 

region  of  integration  in  (B-10)  remains  in  the  convergence  region  of  (B-ll). 
Therefore,  the  integral  in  (B-10)  is 

ISt/t *>(r-  °k)*  bt/t )in f°k)-  -  ••!  =J!"(-0k) 


(B-12) 


Then  from  (B-10)  and  (B-4) 


f2  =  -  jin  .*  *  in  (-  o')j  =■  -  inL*  ^  (-  o*)j 


=  -  in  B*(0)  =  -  in  a*  . 


(B-13) 
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Combining  (B-9)  and  (B-13)  in  (B-l),  there  follows  for  the  maximum  entropy 

Ent  =  -  SLn  |aQ|2  =  j2n  (a/cqo)  »  (B-14) 

where  we  have  also  employed  (52).  (For  p  =  0,  a  separate  derivation  yields 
(B-14)  also.)  Recall  from  (51)  that  Cqq  is  the  upper-left  corner  element  of  R'1, 
where  R  is  defined  by  (47) . 
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Appendix  C 

IMPLICATIONS  OF  ASSUMPTION  OF  WHITE  SPECTRUM 
FOR  MINIMUM  ERROR;  KNOWN  CORRELATION 


We  define  the  crosscorrelation  function  between  minimum  error  T  and  input 
x  in  figure  1  as 


C,  =  7k  x*_( .  alii. 


(C-l) 


Substituting  (59)  and  utilizing  (1),  this  becomes 


C/  m  £  \  Rl-n  ’  *“' 
n=  0 


(C-2) 


Now  from  (64)  and  (65),  we  can  express 


R  •= - »  . 

c 

oo 


(C-3) 


Thus,  (C-2)  immediately  yields 


Cf  = 


-l/cno,i  =  0 

OO 


o,  1  <  SL  <  p 


(C-4) 


that  is,  minimum  error  value  7^  is  uncorrelated  with  the  past  p  inputs 
•  •  •  *  xk-p • 

Now  using  (59)  and  (C-I),  the^autocorrelation  function  of  the  minimum 
error  is 


E«  5  \  V,  *  L  ■»:  \  Xk-,-n  *  £  K  <W ’  a“  *  ■  «C-5» 

n= 0  n=  0 
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In  particular,  using  (C-4), 


£ 


1 


But  from  (C-2)  and  (66), 


t  \  Vl-= 

n=  0 


P  c 


n  =  1  oo 


R 


p^l-n  ' 


(C-6) 


(C-7) 


Therefore,  assuming  E^  =  0  is  equivalent  to  assuming  =  0,  (that  is,  mini¬ 
mum  error  uncorrelated  with  input  which  in  turn  is  equivalent  to 

requiring 


P  c 


n=  1  oo 


R 


p+l-n 


P 


£ 

n=  1 


a  R 
n  p 


+l-n  ' 


(C-8) 


This  relation,  which  may  not  be  true  for  the  actual  process  jx^l  ,  is  a  direct 
result  of  assumption  (70);  the  quantity  Rp+^  in  (C-8)  is  really  an  approximation 
to  the  true  (unknown)  correlation  value. 


Next  from  (C-5), 


E 


2 


f  a 


P 


(C-9) 


Assuming  E2  *  0  (in  addition  to  E^  =  0)  is  equivalent  to  also  assuming  Cp_2  =  0, 
which  in  turn  from  (C-2)  and  (66)  requires  that  we  approximate  according  to 

P  c 

R  =  -  V  -22.  r 
p+2  c  p-2-n 

n  s  1  00 

Continuing  in  this  way,  it  follows  that  assuming  white  noise  for  J  7^} ,  that 
is,  assuming 


-  £ 


a  R 
n  p-2- 


(C-I0) 


Ef  -  0  for  i  >  L  ,  (C-lli 

is  equivalent  to  assuming  that  Cf  =  0  for  i  >  p  -  1 ;  that  is,  the  minimum  error 
is  uncorrelated  with  all  past  inputs.  There  follows  the  approximations 


C-2 
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Pc  P 

r  =  -  T  —  R,  =  V  a  R.  for  i  >p  +  1.  (C-12) 

n  =  1  00  n=  1 


This  recursion  relation  (starting  with  known  values  R^,  R2,  ....  Rp)  can  be 
considered  to  be  an  extrapolation  of  the  known  correlation  values  into  regions 
where  they  are  unknown. 


If  we  augment  (C-12)  according  to 


R  R*  for  j2  >  p+1  . 
then  it  can  be  shown  that  the  spectrum  defined  by 


(C- 13) 


ao  A/C 

AE  R(  exp(-i2rf£A)  - - — - 2  .  (C-l4) 

/  =-  «o  P 

1  -  a  exp(-i2irfnA) 

i  n 

n=l 

which  is  identical  to  (71).  The  transform  in  (C-14)  converges  if  |Rf|  decays 
with  increasing  [i  |  .  that  is,  if  B(z)  of  (56)  has  no  zeros  inside  O  . 
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Appendix  D 

STABILITY  OF  RECURSION  RELATION 


The  recursion  relation  for  approximated  correlation  values  Rf  is  given  in 
(C -12)  and  (75)  as 


R |  =  £  a!  R^  for  i  >  p  +  1 
k=  1 


(D-l) 


Therefore, 


CD 


_ |  r  u  ci>  _  iLir 

u(z)  =  £  R/  z  -  E  \z  E  Ri_kz  •  (°-2> 

f=p  + 1  k=  I  /  =  p  + 1 


But 


®  -/+k  ® 


E  R/-. 


■3  _ 


-k 2  =  £  R  Z  '  = 

/=»p  +  1  j  =  p  +  l  -k  j  =  p  +  l-k 


E  Rj 


r“J 


+  E  R;  Z~J  =  Vk(Z>  +  U(Z)  • 

j  =  p  +  l 


(D-3) 


where 


Vk(I)  *  Rp+1-k  r<P+1‘k’  *  Vk_l(z)  ,  k  >2  ;  Vl(k)  =  Rp  rP  .  (D-4) 

Vk(z)  involves  the  starting  values  Rp+i_k.  ....  Rp  for  1  <  k<  p.  Employment 
of  (D-3)  in  (D-2)  yields 


P  P 

U(z)  =  £  3^  z"k  V  (z)  +  U(z)  £  \  z~k' 
k= 1  k=  1 


(D-5) 
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where  we  have  utilized  (66).  In  order  that  recursion  (D-l)  be  stable,  the  de¬ 
nominator  of  (D-6)  must  possess  all  its  zeros  within  the  unit  circle  O  in  the 
complex  z-plane.  Therefore,  B(z)  of  (56)  must  possess  all  its  zeros  outside 
O  if  recursion  (D-l)  is  to  be  stable.  This  is  guaranteed  by  the  results  in 
(A -3)  et  seq. 


D-2 
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Appendix  E 

IMPLICATIONS  OF  ASSUMPTION  OF  WHITE  SPECTRUM; 
UNKNOWN  CORRELATION 


The  minimum  error  sequence  is  given  by  (96)  and  (101)  as 


?  =  T  a  x,  ,  all  k. 
k  n  k-n 

n=  0 

The  sample  autocorrelation  of  j is  defined  here  as 


(E-l) 


r  r.  7*  =  Y  H  a*  S4 

— '  k  k-f  *—•  m  n  /+n-m 

k  m,n  =  0 


(E— 2) 


using  (E-l)  and  (98).  The  sample  spectrum  of  is  defined  here  as 

Hr(f)=4  f;  Ff  exp(-i2*fiA)  =  Hx(f)|A(f)|2,  If  I  <  ± ,  (E-3) 

1=  -CD 

where  we  have  employed  (E-2)  and  (107)  and  defined  the  sample  spectrum  of 

lxki  as 


H  (f)  =  A  £  Sf  exp(-i2rrfiA),  If  I  <  — . 


(E-4) 


l  =  -CD 


Therefore,  (E-3)  yields 


H7(f)  1 

H  (f)  =  - rj  ,  If  1  <TT. 

X  I  A(f)|  2  2A 


(E-5) 


Now  we  will  assume  that  the  sample  spectrum  of  {7^1  is  white;  that  is,  we 


E-l 
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H7(f)  =  KA,  If  I  <  — , 


(E-6) 


where  K  is  a  constant.  We  then  adopt  an  estimate  of  the  sample  spectrum  of 
sequence  jx^  ^  according  to 


H  (f)  = 
x 


H7(f) 
A(f)  |'2 


KA 


V  IT  exp  (-i2nfn  A) 

r,  n 

n=  0 


(E-7) 


and  adopt  a  scaled  version  of  this  quantity  as  a  spectral  estimate  of  process 
I  xn|  • 


G„(f)  = 


£  a  exp(-i2rrfnA) 
n=  0  n 


?  Ifl<2l' 


(E-8) 


The  white  assumption  in  (E-6)  forces  us  to  assume  that 

F 't  =  0  for  j2  *  0 , 


(E-9) 


as  (E-3)  shows.  In  order  to  see  what  this  implies,  we  utilize  the  definition  of 
the  sample  crosscorrelation  in  (109),  along  with  (96)  and  (98),  to  obtain 


=  i  E  =  I  a  S.  ,  all  J2  . 

N  k  k-f  n  l-n 


(E-10) 


n=  0 


,-tJse  of  (101)  then  shows  that 


Dj  =  0  for  j.  <  J2  <  p  .  (E-ll) 

Meanwhile,  tttfe  sample  autocorrelation  in  (E-2)  can  be  written  in  the  form 

P 

F,  -  £  ^  o  .  an  i,  (E-I2) 

n=  0 
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upon  employment  of  (E-10).  There  immediately  follows  from  (E-9),  (E-ll), 
and  (E-12) 


F 


1 


But  then  (E-9)  and  (E-10)  indicate  that 


(E-13) 


P 

S  =  Y*  a  S  ,  ,  (E-14) 

p+1  ^  n  p+l-n  v 

n=  1 

where  |  a^}  2  11:6  t^e  solutions  of  (102).  But  relation  (E-14)  may 'not  be  true 
for  the  quantity  Sp+i  actually  obtained  from  data  |xn|  ^  via  (98).  Thus,  as¬ 
sumption  Fi  =  0  is  forcing  us  to  assume  that  Sp+2  can  be  obtained  via  (E-14) 
and  (102),  when  jsf  are  obtained  from  (98). 

Next  from  (E-12)  and  (E-ll), 


F 


2 


+  TC*  D 
P 


p+2  ‘ 


(E-15) 


Assuming  F2  =  0  (in  addition  to  F^  =  0)  is  equivalent  to  also  assuming  Dp+2  =  0, 
which  in  turn  from  (E-10)  requires  that 

P 

S  =  £  a  S  „  . 

p+2  ^  n  p+2-n 

n=  1 

Continuing  in  this  way,  it  follows  that  assuming 

F,  =  0  for  i  >1 

is  equivalent  to  assuming  Df  =  0  for  i  +  1;  that  is,  the  minimum-error  se¬ 
quence  is  uncorrelated  (on  a  single  member  function  basis)  with  all  past  inputs. 
There  follows  the  estimates 


(E-16) 


(E-17) 


P 

Sf  =  £  aQ  S  ,  i>p  +  1.  (E-18) 

n=  1 

Stability  is  discussed  in  (111)  et  seq. 
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Appendix  F 

BOUND  ON  CROSS-GAIN 


The  value  of  the  cross-gain  gp  in  (155)  can  be  written  as 


£  f(p-i) 

,  n  n-1 
n  =  p  +  l 


z  I 

n  =  p  +  l 


£ 

n=p  +1 


3<P-1) 

n-1 


(jJgLlJgfT 

£  (l>rf*toT) 

n=p+l '  '(F- 


The  first  factor  in  (F-l)  is  of  the  form  of  a  correlation  coefficient  of  the  (p-1)- 
th  order  forward  and  backward  sequences  and  can  never  exceed  unity  in  magni¬ 
tude  (by  Schwarz's  inequality).  The  second  factor  in  (F-l)  is  almost  always 
very  close  to  1:  let  the  pair  of  sums 


{  £  I'M2  ^  £  I-&T}  •  { 

vn=p+l  n=p +1  /  V 


A  and  A(l+r) 


(F-2) 


where  r  >0  without  loss  of  generality.  The  second  factor  in  (F-l)  then  equals 
(1+r)1'2 

-~+^y2  - ,  which  is  never  larger  than  1  and  is  tabulated  below.  Thus,  gp  in 

(F-l)  is  virtually  identical  to  the  correlation  coefficient  of  the  forward  and  back¬ 
ward  sequences,  since  r  is  near  zero  with  high  probability. 


Table  F.l  Second  Factor  in  (F-l). 


r 

0 

.1 

.2 

.3 

.4 

.5 

(l+r)1/2 

l+r/2 

1 

.999 

.996 

.991 

.986 

.980 

F-l/F-2 
REVERSE  BLANK 
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Appendix  G 

CLOSENESS  OF  ERROR  MEASURES 


Two  possible  error  measures  for  the  Burg  technique  were  presented  in 
(150)  and  (156).  For  p  =  0,  employing  (154)  and  (152), 


(0)  _  1  N  2 
F  "  N  E  M 


(G-l) 


Comparing  this  result  with  (151),  we  find 


J.(°)  .  p(0)  . 

Thus,  the  two  error  measures  are  identical  for  p  =  0. 
Next  from  (150)  and  (151) 


(G-2) 


(G-3) 


whereas  from  (156),  (160),  (155),  and  (152), 


(G-4) 


G-l 
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But  now  reference  to  (151)  and  (G-3)  reveals  that,  for  N-l  large, 

.2 


F^1}  a  (l.  -  a^  ^  P(0)  =  P 


(1) 


Continuing  with  (150), 


P<2>  =  P(1»  (l  -  a(2)l 


And  (156),  (160),  and  (155)  combine  to  yield 

(2)  2\  Den (2)  _  /  I  (2)|2\  1  V 

2  /  2(N-2)  \  I  2  |  )  2(N-2)  ^ 


f(2)  =  |1  - 

O 


N 

L 

n  =  3 


But  from  (154), 


•  d)  . 


2(N-1) 


N 

I 

n  =  2 


,(l> 

n-l 


Comparing  (G-7)  and  (G-8),  we  see  that,  for  N-2  large, 

o  \  12  1/0 


Then  employing  (G-5)  and  (G-6),  we  have 


F®=(..  42>2'1p11'  =  P(2) 
o  V  2 


(G-5) 


(G-6) 


f<l>,  * 
n 


2  ‘b(1) 


n-l 


(G-7) 


(G-8) 


(G-9) 


(G-10) 


which  is  the  desired  relationship.  In  general,  for  no  bad  data  points,  we  have 

F(P)  »  p(P)  for  N_p  japgg  <  (G-ll) 


Numerical  computations  have  borne  this  result  out,  with  the  two  quantities  not 
having  any  ordered  relationship;  that  is,  either  quantity  can  be  larger  (or 
smaller)  at  different  stages,  p.  (G-9)  generalizes  to 


for  N - p  large  . 


(G-12) 


G-2 
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Appendix  H 

SCALE  FACTORS  IN  SPECTRAL  ESTIMATES 


Instead  of  using  a  unity  value  for  the  average  minimum  error  or  residual 
power  in  the  numberator  of  (167),  we  could  use  the  value  given  by  (156).  Then 
our  spectral  estimate  would  be 


A  F 


(P) 


Gx®  = 


1  -  £  a^  exp(-i2*fkA) 
k=  1 


2’ 


(H-l) 


An  alternative  approach  is  to  use  an  arbitrary  scale  factor  K  and  choose  it 
so  that  the  area  under  the  spectral  estimate  is  equal  to  the  sample  power  (151), 
as  suggested  under  (108);  that  is,  set 


G  (f) 
x' 


1 


_ AK _ 

P 

£  aj  exp(-i2»rfkA) 
k=  1 


Ifl  < 


_1_ 
2A  * 


and  force 


(H-2' 


(H-3) 


Substituting  (H-2)  in  (H-3),  and  using  (159),  we  have 


1 

2A 


P 


df 


AK 

K 

P 

2  p 

2. 

1  -  £  a£^  exp(-i2nfkA) 

n  1 

*r 

k=  1 

m  =  1 

i 

(H— 4) 


H-l 


The  last  step  in  (H-4)  is  proven  as  follows:  from  (A-8)  and  (29),  we  know  that 


2A 


P 

T.  K 

k  =  1 


(P) 


exp(-i2«fkA) 


R  cW 
o  oo 


But  from  (A -7), 


R 

o 


R 

o 


c(p-d 

oo 


p 


n 

m  =  1 


(H-5) 


(H-6) 


where  we  have  employed  R0  cq*J  =  1.  The  relationship  in  (H-4)  holds  when  the 
filter  coefficients  are  determined  via  (148). 


Therefore,  (H-4)  yields,  with  the  aid  of  (150). 


K  =  p(0'  n  ji-ICf!  ‘  P<PI- 

m  =  1 


and  the  estimate  (H-2)  becomes 


cx(f,  . 


.ip 


(P) 


r 

£  \  exP  ( - 1—  -  fk.  A  i 
k=  1 


(H-7) 


(H-8) 


The  very  close  similarity  of  values  between  the  alternatives  (H-l)  and  (H-8)  is 
made  evident  by  the  results  of  appendix  G,  in  particular  (G-ll).  Thus,  there  is 
virtually  no  difference  between  estimates  (H-l)  and  (H-8),  for  no  bad  data 
points. 
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Appendix  I 


BIASEDNESS  OF  BURG'S  CORRELATION  ESTIMATE 


For  the  Burg  technique  with  p  =  1,  N  =  3,  we  find  from  (162)  and  (144)  that 
(for  real  data) 


R  ml 
R1  3 


X2(Xl~X3)(XrX2*X3) 

2  2  2 
x.  *  2x„  +  x„ 
12  3 


The  mean  of  this  random  variable  depends  on  more  than  just  xjjx^  (=  x3x2  ) ;  in 
fact,  it  depends  on  the  third-order  joint  density  of  (x^,  x2,  x3).  As  an  exam¬ 
ple,  let 


X1  =  u  ,  x2  =  1  ^  (u  +  v) ,  x3  =  v  ,  (I 

where  u  and  v  are  independent,  zero-mean,  unit -variance,  Gaussian  random 
variables.  Then  x^,  X2»  x3  are  zero-mean,  unit-variance,  Gaussian  random 
variables  with 

x2  Xl  =  X3  X2  =  -  “I’  *3*1  =  °  '  (I 

Employing  (1-2)  in  (1-1),  we  obtain 

2  2  2 

£  1  1  (u  +  v|  (3u  +  2uv  f  3v  )  /T 

R1  “  -  6  2  2’ 

V*  u  +  uv  ♦  v 


Therefore, 


_  /•/•  /  2  2\  22 

£  11  111.,  (  u  +v  \(u  +v)  (3u  +  2uv  + 

“I'-Jlsrll  du  dv  exp(  5  / - 5 - - 

v  u  «•  uv  +  v 


.2  2 


1-1 
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■  =j  s  i  /*  >■  — 

0  -it 

where  we  have  changed  to  polar  coordinates  and  let  C  =  cos  9,  S  s  sin  0 

integral  on  r  in  (1-5)  is  2,  and  the  integral  on  9  is  W2  -  i\  . 

V  v/3/ 


Therefore. 


r  =  +  i  a  ^  i 

1-^2  9  ~JZK% 


y/SJ 

9484)  . 


which  is  not  equal  to 


X2  XL 


X  x„  =  +  - 

3  2  -^2 


,  d-5) 

.  The 

(1-6) 

(1-7) 


1-2 
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Appendix  J 

FORTRAN  PROGRAMS 


The  programs  in  this  appendix  are  written  for  reaJ  data,  but  may  be  readily 
generalized  to  complex  data  by  means  of  the  general  equations  in  the  main  text. 
From  (H-8)  and  (H-7),  for  real  data,  the  spectral  estimate  is  given  by 


Gx«t,  - 


i  n  ji  -  4k,2(  p'01 


1  -  a^>l  e*-P(-i2  »fkA) 
k=  1 


2  * 


f  < 


2a  ‘ 


(J-l) 


Let  frequency  increment 


*f  ^ 


1 

JA 


J _ 1_ 

J/2  2  A 


J/2  ’ 


(J-2) 


where  fN  is  the  Nyquist  frequency,  and  J  is  an  integer.  Then,  using  (H-3)  and 
the  real  behavior  of  the  data, 


1 

2  A 


P(0)  =  2 


f  df  Gx(f)  =  2AP(0)  /7  jl  -  a£°‘ 

*  n 


k  =  1 


J/2 

E  v 

m  =  0 


m 


P  to)  |2 

1  -  aj  exp(-i2 * mA-kA) 

k=  1  K  1 


-  V0’  n 

J  k-i 


1  -  a 


(k) 


2,  J/2 


m 


m  =  0 


1  -  a^  exp(-i2  *mk/J) 

k=  1  K 


J/2 

.«  5  £  «  pm  • 

2  mm 

m  =  0 

(J-3) 


J-l 
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where  |<m|  is  a  set  of  integration  weights  (for  example,  trapezoidal).  So  we 
can  compute  (independent  of  time  increment  the  quantity 


k=  1 


-  £  exp(-i2^ 


mk/J) 


for  0  <  m  <  — 


which  represents  the  fractional  power  in  the  frequency  band 


that  is, 


J/2 


E 

m  =  0 


m 


1 


(J-5) 


(J-6) 


if  estimate  Gx(f)  in  (J-l)  has  been  sampled  finely  enough  (that  is,  large  ■]>.  The 
denominator  of  (J-4)  is  recognized  as  a  J-pomt  FFT  of  p-1  nonzero  numbers, 
hence,  J  should  be  chosen  as  a  power  of  2  for  speed  purposes.  The  programs 
below  yield  the  fraction  of  power  in  frequency  bands  of  width  if  J  is  an 

integer  large  enough  that  the  spectral  estimate  (167)  or  (H-6)  is  adequately  sam¬ 
pled  to  keep  track  of  its  peaks. 


NO  BAD  DATA  POINTS  (SUBSECTION  4.  7) 


The  data  generation  is  accomplished  via  function  IRAND,  which  generates 
integers  uniformly  distributed  over  (0,  235-  i>;  by  RAND,  which  generates  num¬ 
bers  uniformly  distributed  over  (0,1);  and  by  TLNORM,  which  generates  zero- 
mean  unit-variance  Gaussian  variables.  The  FFT  used  below  is  that  presented 
in  reference  25. 


J-2 
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L  SPECTRAL  ESTIPATION  USE*:  CFiANSE  LINE  13  ANO  REPLACE  LINES  17-31 
C  N  S  liOrtbEH  OP  UATA  POINTS 
C  XU>....»X(N)  =  INPLT  DATA 
L  PKAX  =  MAXIMUM  ORDER  OF  F1J.TER 
L  PBtST  =  BEST  ORDER  OF  FILTlR 

C  A ( 1 )»,,,» A ( rbEsT )  =  PREDICTIVE  FILTER  COEFFICIENTS 
C  PKOC  =  PRODUCT <1-A(P)**2)  FOR  Psi  TO  PBEST 
L  RPu(1),...,kHO|PmAX)  =  NORMALIZED  CORRELATION  COEFFICIENTS 
L  u  =  bizt  OF  FFT  (MUST  BE  A  POaER  OP  2) 

c  XX  1 1 ) .... . Xa (u/i*l )  =  FRACTIONAL  POWERS*  FROM  DC  TO  NYOOIST  FREOuEnCY 

c  ccuj#,..tcj<u/M*i)  =  guarter  cosine  table 
c  r  anu  yy  ARt  required  auxiliary  arrays 

PANaMETEk  ns  100*  PM.AXSlO*  JS2048,  U41SJ/4* 1 
Ii.TE&EK  PbE$T 

CIMENSIOl,  X(N) »Y(N) iA(PmAX) ,RHO(PMAX) . XX(J) *YY(J) ,C0tM4l) 

L  INPUT  LATA  in  . . . . . . 

CtFll.E  IRAND  =  I*5**15-f(  <1“SIGM1*  I*5**i5)  >/2>  04359738367 
OlFINE  RaNDsFlOAT ( 1 ) /34359738 367 • 

1=5261 

I.STakT=N«4C0  *  •  ILL  DISCARD  INITIAL  400  POINTS 

Xxtl)=0, 

xx  12 ) =0 , 

XX13)=0. 

XA14)=0. 

Cu  11  LS5.NSTART 
IslRANu 

XX  <l>  sx,7607*XX (L-l )-3.o106»XX(L“2j *2,653b«Xa (l-3 ) - 
AO  * 9236* Xx IL-4 ) *T INuRM ( raND# ill ) 

11  CwxTlNUE 

uu  12  1  =  1#  it 

12  xiI)=XA( I+NSTART-N) 

PH1NT  1. 

1  F orK  AT l  /  •  INPuT  uATa:*) 

PrInT  a,  (X ( I  )  « 1=1«N) 

u  EVALUATE  PKLulCTWE  F*lTEm  COEFFICIENTS 

Call  dUH„(N.PMAX*X. Y.PoEST ,A*PROO*RHOj 
FrIF.T  •»,  X  ( N  ) 

9  FurMATC/'  MEAN  S'.LlA.B) 

PRINT  10*  Y<t.) 

10  FurlATC  STANQARL  uEVIAT ION  =**£13.6) 

PrInT  2 •  Pu£ST 

c  fumm at ( / •  Petsr  =  * » 1 3 1 

PrInT  3, 

F  UrvMAT  t  /  *  PREulC  T  I  v  E.  F1lTEP  COEFFICIENTS  :  '  ) 

PR1I.T  A,  ( *•  (  i  )  *  1  =  1  *  PB£ST  > 
a  F ^n« AT ( 5t20 , B ) 

PrInT  6.  PRCC 

3  F lrMAT ( / *  PRODUCT 1 1-A (P ) **2  )  =**£13, 8) 

print  o * 

c  F  urrAT ( / *  i.ORMAlIXLL  CORRELATION  COEFFICIENTS:') 

Print  4,  (RhO( 1 ) • i=l,PMAX) 

Call  OTRlOS ( Co* J ) 
c  Evaluate  FRaCTiOn«l  POWERS 

Call  PG«£RS { PbtST . A , prou *  w *  XX ,  Y Y  *  CC  > 

Print  7, 

I  FORMAT l/*  FRACTIONAL  PGaLHS:') 

L=u/2«1 

PrInT  6,  ( xx ( 1 ) 1 1=1 iL) 

o  F jrrAT (2x* 1&E1J.6) 

E*»w 


J-3 
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SUBROUTINE  bUR6(N,PMAX,x»r,PBEST»A«PROO,RHO)  £  2  Feb  1976 

C  THIS  SUBROUTINE  COMPUTES  ThE  PREDICTIVE  FILTER  COEFFICIENTS 
C  Nr  NUMBER  of  DATA  POINTS!  INTESER  INPUT 
c  PMAX  s  maximum  ORDER  OF  FILTER!  INTEGER  INPUT 
c  X(l) ,X(2),.,.,X(NJ  =  DATA  ARRAY  ON  INPUT*  ALTERED  ON  OUTPUT 
C  ON  OUTPUT t  Xli) *X(2)»...*X(PMAX>  =  AUIPMAX) , A (2 1 PMAX) . . . . . A iPMAx IPMAx) 
C  mi  ,Y(2)  •  ,..*Y(h>  s  AUXILIARY  ARRAY!  SCRATCH  INPUT 
L  ON  OUTPUT  *  yU)*Yl2),...,Y(PMAX)  -  *  ( 1 1 1  >  ,  A  U 1 2 1  , .  ,  . .  A  (PMAX  IPMAX ) 

C  ON  OUTPUT.  X (N )  =  MEAN,  A NO  Y (N)  s  STANOARD  DEVIATION  OF  InPuT  DaTa 
C  PBEST  s  BEST  ORDER  OF  FILTER!  INTEGER  OUTPUT 

C  All),A(2),...»A (PBEST )  s  PREDICTIVE  FILTER  COEFFICIENT  ARRaY  s 
C  A ( 1 iPoEST ) « A(2iPbEST ) « , . . , a (PBEST I PBEST) i  OUTPUT 
C  PROD  s  PR0CUCT(1-A(P!P6EST)**2)  FOR  P?1  TO  PBESTl  OUTPUT 

C  HhOll) . . (PMAX )  =  NORMALIZED  CORRELATION  COEFFICIENTS!  cuTPuT 

C  DIMENSION  X ih) , Y (N ) #  A (PM AX ) »RHO (PMAX )  IS  REQUIRED  IN  MAIN  PROGRAM 
INTEGER  PMAX, PBEST, P 

double  precision  s*,sb 

DIMENSION  X(I) ,Y(1) ,A(I) .RhO(l) 

IF IPMAX. GT.J.mSGRT IN) )  PRINT  2,  Pmax.N 

2  format </•  pmax  =*»i4,»  is  too  large  for  nombe*  of  data  points  =  • 

S,  IS) 

C  COMPUTE  mean 

S1=0. 

DO  1  151, N 
1  SisSI«X(i) 

SlsSl/N 

C  SUBTRACT  mean,  and  scale  To  unit  variance 
S2S0. 

CO  J  1=1, N 
Xll)SX<I)-Sl 

3  Sc=S2+X ( I ) «*2 
S2=SGRT(S2/(N-I. ) j 
T=l./S2 

DO  5  151,  N 
A(I)SXII)*T 
9  Y(1)5X<I) 

C  BEGi.i  RECURSION 
p  =  0 

PhUUUCSl, 

AICMIN=0. 

PbfeSTsO 

MMOOSl. 

D  PSM^l 

t  CAuCwuATE  CkOSS-GAIM  EQ.  lSS 
Sasc.DO 
SosO.UO 
C=M*i 

Du  7  I=L,N 
SA=SA*A ( A ) *Y I I“1 ) 

7  S«jssa*A(l)»«2*T(I-i)**2 

U52.*SA/Sfa 

PkuOUC=PmOUuC* ( 1 . -U*G ) 

u  CAuCoUATE  FILTER  COEFF  ICIlnTS  »  EQS.  IpOAImB.  store  In  XU) . XtP) 

a  iM)=G 

1»- (P.EO.l)  GO  TO  b 
t.iP/2 

Du  V  1=1, u 
Tsa ( 1 1 “G»  X <P“l ) 

A  vP-IISXJP-II-G.XU) 

V  M!):T 

U  CalCuua T t  NURMALUtO  CORRELATION  COEFFICIENT!  Eu.  149 
«  Tsa (P) 

iFlP.Lu.l)  GO  TO  14 
LiP-1 

uu  19  1  =  1, L 

I«  TsT*AfI)*RhO(P-l ) 

14  HMU(P)=T 


J-4 
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L  CALCULATE  AkAIkE ' S  INFORMATION  CRITERION!  EOS.  166A202 
RfcLEKHs  ( 1 .  -<,*6 )  »SNuL  (Sb)/(2.*(  N-P )  i 
AICrLOG { R£L£KR) *2 . *FLCAT ( H ) / <N-P ) 
lr UIC.GE.AICMiN)  60  TO  10 
AICmINsaiC 
rntSTsP 
PhOU=PRCOUC 
Ow  11  lsl*P 
*1  AU):xii) 

10  IF  IP.Em.PmAX)  uO  TO  16 

C  oPDATl  F0R«aRO  AhO  6ACKNARW  SEQUENCES  I  E6.  153 
LsP*! 

oO  12  I  TIN  »  L ,  ”1 
T=A(1)-G,Y(I-1) 

Y(I)SYII-1)-5*X<I) 

12  X(I)ST 

Y  IP) 56 
60  TO  6 

16  Y(PmAX)sG 

IF(PBEST,t«.PMAX)  60  TO  A 
C  COmPuTE  EXTRAPOLATED  NORMALIZED  CORRELATION 
C  COEFFICIENTS  from  PBEST*1  to  PMAXi  E«»  165 
LsPBEST+l 
DO  17  PsL’PMAX 
AiPjsO. 

TsO. 

DO  10  I=l»PBEST 

16  TsUA(1)«BHO(P>I) 

17  RhO(P)sT 

4  X(N)SS1 

Y(N)SS2 

RETURN 

End 


Subroutine  powersipbesi ,a,proo*u»xx,y1t,co) 
c  this  subroutine  computes  the  fractional  powers  in  bands  1/ 1 JaCELTA ) I  EO.  U-4 

C  PBEST  s  BEST  order  OF  FILTER*  INTEGER  INPUt 
C  All),....A(PBEST)  s  FILTER  COEFFICIENT  ARRAY!  INPUT 
C  PROD  s  PRODUCT (1-A(P)**2)  FOR  Psl  TO  PBEST!  INPUT 
C  J  s  SIZE  OF  FFT  (U/2+IsNUmBER  OF  FREQUENCY  POINTS)!  INTE6ER  INPuT 
C  XX  s  AUXILIARY  array  on  input 

C  XX ( 1 ) , . , . , XX  lu/2+1 )  s  FRACTIONAL  POWERS  ON  OUTPUT 
C  YY  S  AUXILIARY  ARRAY!  SCRATCH  INPUT 

c  co(1)#...,co«u/aa1)  =  ouarter  cosine  table  for  ffti  input 

C  DIMENSION  XX(U)«YY(u),C0(U/441)  IS  REQUIRED  IN  MAIN  PROGRAM 
C  DIMENSION  A(PMAX)  IS  REQUIKEO  IN  MAIN  PR06RAM ,  WHERE  PMAX.6E.PBEST 

integer  pbest 

DIMENSION  All) .XXII) ,YY<1) .C0(1) 

FsPH0D«2«/J 

xxil)*l. 

YY (1)20. 

DO  1  1=1, PBEST 
XX(I*1)S*A(I) 

1  YY(I4l)S0. 

LSPBEST42 
Du  2  Isl.J 

xxil)=0. 

2  YY ( 1 ) SO , 

Ls 1 , 4427 •LOG ( J ) ♦ . 5  »  L0621J) 

CALL  M*lFFT(Xx»YY,C0»L»-1) 

Lsu/241 
DO  3  1=1, L 

3  XA(I)SF/(XX(I)**24YY(I)««2) 

RETURN 

End 


J  -5 
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JNt'Ul  *>ATA; 


-.b39570964Qi 

.113*0011*32 

.15902415*02 

. 74923698*01 

-•£177loU*0i 

.o7404896*01 

.20620527*02 

.*9559656*02 

-.2S©oou*S+0* 

- . *  1166625*0* 

-.50993711*02 

.25941067*01 

.5<*h01QSJ+0* 

.ou074433*C 2 

-. 725*2098*01 

-.433*8869*02 

-,J9t>4b©76*0* 

.57681677*31 

.46871204*02 

.60260199*02 

-.59391618*02 

-.52051206*02 

-.37*37309*32 

*2 So 5 

.*3656428*02 

.42103511*02 

.237*6203*02 

-.iaj7WJ9*Q  2. 

-.*0384513*02 

-.35354962*02 

-.15176634*02 

.  .16701  j7WQa: 

.*7343318*02 

.337848*5*02 

.*3979633*01 

-  ,56b7i^62»Q^ 

-.50217300*02 

-.18068573*02 

. 2234*4*1*02 

.2106  7912*02 

-.11739*32*02 

-.52019095*02 

-«  I795ou*,Q^C^ 

■  *-6C  7960*01 

.17644*500*02 

.*6641572*02 

. u  785o*  77*01 

-.1*015275*02 

-,3243b875*u2 

•  •  2  **  d  *}*.  a.  o  0  ♦  Ox 

- • *o7562c2*CG 

.23166370*02 

.35915092*02 

.  l«79l,iS7^0J 

-  * . . 0b9*7**C2 

-.28198744*02 

jU79*47*J£. 

- .  lol  Qou'ji^Ol 

• l*3Coo3**C2 

. *899*507*02 

. *921*26**j2 

- . 1oo3*98**C* 

-.  35072288*02 

-. 5816*2*3*02 

6li02J07i40£ 

.-8910*09*02 

.8*1 12801, 6? 

.-*7700*3*02 

-«  5oo0uj>50^C^ 

-.aJ-S/312*u2 

-.57823633*02 

-.116117-7*53 

*oi»«*0372o-»0^ 

•  a33e03*4*52 

.60*09774*01 

-.5297u:0*C2 

-t>N  1  .11733513*30 

il-i.uAHa  -Lt.oi'lO..  s  ,j506o007*02 


-.227271*J*01 
. 10603774*01 
.38769337*02 
-.61963478*07 
.39106104*02 
-.33956462*01 
-.36984612*01 
. 1 3645936*02 
-.34617333*02 
.46678704*02 
-.37151017*02 
.29231878*02 
-.376iesfs*j2 
.316*5702*02 
-.19231728*02 
•  13325086*02 
-.21167069*02 
4.37641*48*01 
.3  9241 863*  J2 
-.79246433*02 


40687  8  4 

6„Ll,ltTl<e.  91*064  CutFMCa.ih; 

,27j3tl51*01  -,37?23tl8*;i 

l-6wiOUCr<l-A(6j  .**/  -  .689565*7-13 

f.0n6A*12t^  £4j.T  1  *0.  5028 F  *C  lt'*l 5  1 

.7330*390*02  .81104860-01 

-.23157.07*00  .49061750*00 


.*0436127*7! 


-.59619*89*70 

.76837693*27 


-.43577752*00 


-.43001970*00 
.  77  6  j* 70  7* 


-.77701027*00 

.  36.'43c2-»00 


BAD  DATA  POINTS  (SUBSECTION  5.2) 


c  spectral  estimation  for  bao  data  points  user;  change 
C  LINE  17  and  replace  LINES  22-36  AN0  41-46 
C  N  S  number  of  data  POINTS 
C  X(1),...,X(N)  s  INPUT  DATA 

c  bm*x  s  maximum  number  of  bad  data  points 

C  B  s  ACTUAL  NUMBER  OF  8A0  OaTA  POINTS  (MUST  HAVE  B.lE.BMax) 

C  M(l) , , , , iM(B)  s  LOCATIONS  OF  BAO  OATA  POINTS 
C  PMAX  5  MAXIMUM  orcer  OF  FILTER 
C  PBeST  s  BEST  OROER  OF  FILTER 

C  AID . * (PBEST)  s  PREDICTIVE  FILTER  COEFFICIENTS 

C  PRoO  s  PROOuCT ( 1-A (P ) **2)  pOR  psl  TO  PBEST 
C  RHOU),.,.,RNO(PMAX)  =  NORMALIZED  CORRELATION  COEFFICIENTS 
C  J  s  SIZE  OF  FFT  (MUST  BE  A  P0»ER  OF  2) 

C  XX(1)*.,,,XXIU/2U)  s  FRACTIONAL  POWERS'  FROM  DC  TO  NYOUIST  FREOuEnCY 
C  CQ<i)#...,C0l>J/,**I)  =  QUARTER  COSINE  TABLE 
C  Y*  Yr»  ANO  IP  are  required  AUXILIARY  arrays 

Parameter  ns  ioo*  bmaxs  25»  pmax=iq,  Ds2048»  uvisj/q-u 
InTESER  8*PBEsT 

OIMENSION  X(N) , r(N) ,A(PmAX) ,RHO(PMAX) *XX(J) »TY(J) i CO ( J41 ) 

OIMENSIOn  M(BMAX) • IP(N) 

C  INPUT  OATA  IN  X <  1 . . X (N) 

DEFINE  IRAN0=I»5«*I59.{  a-SISN(i,I*5**I5;)/2)*3435973B367 
DEF INE  RANOsFLOAT ( l ) /3435973B367. 

1=6281 

NS7aHT=n*400  .  WILL  DISCARD  INITIAL  400  POINTS 
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XX(1)=0. 

XX(2)=0. 

XX(3)=0, 

XX14)=0. 

LO  11  L=S,NSTAkT 
IslRAND 

XX(L)S2, 7b07*XA(L-l)"-3,al0b*XX(L_21w2,653b*XXlL-3)- 
SQ .923S*Xa IL-4 ) +T INORM (kaND,S11) 

11  CONTINUE 

CO  12  1=1, N 

12  XU)=XX(X+NSTaRT-N) 

PRINT  1, 

i  fokmau/i  input  data:') 

PhInT  4,  (X ( I ) , 1=1 ,N) 

t  EutEk  b,  ano  enter  bad  oata  locations  in  mid  »...#m(B) 

B=S 

N(l)=3 
K(2)=7 
Mi3)=ll 
M4)  =  l 2 

M  =  )=1S 

o  evaluate  predictive  filter  coefficients 

Call  BuRG6D(N,PMAX»X,H»M* IP,Y,PBEST.A,PROD»RHO) 
Print  9,  x(n) 

9  FORMAT  1/  '  MEAN  ='  iEm,b) 

PRINT  10,  T(N) 

10  FORMATC  STANDARD  DEVIATION  =',£13*81 
PRINT  2,  PBEST 

*  FORMAT (/  •  PBEST  =*>13) 

PRINT  3, 

j  FORMAT ( / •  PREOiCTIVE  FIlTER  COEFFICIENTS! ' > 

PRINT  4,  (All), 1=1, PBEST) 
h  FOrmaT  15E20.B) 

FrInT  5,  PKOu 

s  Furmat 1/ •  product u-A(P)**2)  ='»E13.B) 

PRINT  b, 

b  FORMAT  l/»  No«MALUtO  CORRELATION  COEFFICIENTS:') 

Print  **,  <khO(d ,i=i,pmax> 
call  QTRCOS(CO* J) 
c  evaluate  fractional  powers 

call  POWERS (PBEST, A, PROD'U, XX, YY> CO) 

Print  7, 

7  FORMAT (/•  FRACTIONAL  POWERS!') 

Lsu/2+1 

Print  b,  (XXII) ,I=1,L) 
a  FoKMAT(2x,10El3,6) 

EnD 


Subroutine  burgbo(n,pmax*x,b,m»ip,y,pbest,a»prod,rho)  w  2  feb  1976 
l  this  subroutine  computes  the  predictive  Kilter  coefficients  for  b  bad  points 

C  N  =  NUMBER  of  data  POINTS  I  INTEGER  INPUT 

c  PMAX  =  MAXIMUM  ORQE8  OF  FIlTER)  INTEGER  INPUT 

C  Xll) ,X(2) , ,,.|X(N)  =  DATA  ARRAY  ON  INPUT)  ALTERED  ON  OUTPUT 

c  on  output,  mi  ,x(2> ,...,X(Pmax)  =  auipmax>,A(2)Pmax) . . 

C  B  s  NUMBER  OF  bad  OATA  POINTS)  INTEGER  INPUT 

C  MU)  ,M(2) ,  .,,,M(B)  =  LOCATIONS  OF  BAD  DAYa  POINTS)  INTEGER  INPUTS 
C  THESE  LOCATIONS  MUST  @£  OISTlNCT  AND  LIE  IN  THE  RANGE  C1,N3 
L  IPiD*XP(2>,...,IP(N)  =  AUXILIARY  ARRAY)  SCRATCH  INPUT 

c  tu),t(2),.,.,t(n)  =  auxiliary  array?  scratch  input 

cn  output,  t(1)»y(2)»...,Y(Pmax)  =  au)1>,a<2)2)»...,a<pmax)pmaxi 
-  cn  output,  *<n>  s  mean,  ano  t<n)  =  standard  deviation  of  input  data 

.  PbCST  s  BEST  ORDER  OF  FILTER)  INTEGER  OUTPUT 

*Ui.A(2>....,AlPeEST)  =  PREOICTIVE  FILTER  COEFFICIENT  ARRaY  = 

»  4<t»B£ST),Ai2>P8EST>....,A(PBEST)PB£ST)l  OUTPUT 
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C  PROD  =  PR00UCT(1-A(PIPbEST)*»2>  POR  P*1  TO  PBESTl  OUTPUT 

C  MHO . . . RHO(PMAX)  =  NORMALIZED  CORRELATION  COEFFICIENTS!  OUTPUT 

C  DIMENSION  X(N) ,Y(NJ *A<PMAX) »RH0<PMAX)  IS  REOUIREO  IN  MAIN  PROGRAM  • 
C  DIMENSION  M(6MAX) »IP(N)  IS  REQUIRED  IN  MAIN  PROGRAM 

integer  pmax»b»p6£St»Pibp 

DOUBLE  PRECISION  SA,SB 

dimension  X(1)»mu),ip<i>#y(1)>a<u,rmo<i> 

IF(B.GT.O)  GO  TO  21 

call  burg  in , pmax *  x *  y  * pbest , a # proo , rhoy 

RtTURN 

21  LsN-8 

IP (PMAX ,GT ,3i*SQRT (L) )  PRINT  2.  PMAX.L 

2  FORMAT!/?  PMAX  ='*I4#'  IS  TOO  LARGE  FOR  NUMBER  OF  3000  DATA  POINTS 
*  n-B  =',I5J 

C  SET  UP  IP  ARRAY  FOR  P=0»  EQ.  173 
DO  22  1=1, N 

22  IP(I)=1 

Do  23  L=1»B 
I=M (L) 

23  IPII)=0 

C  COMPOTE  MEAN  OF  GOOD  UATA  POINTS 
Sl=0. 

DO  1  1=1, N 

IFlIPUi.EQ.O)  GO  TO  1 
Si=Sl*X<I) 

1  CONTINUE 

Sl=Sl/(N«Bl 

C  SUBTRACT  MEAN,  AnO  SCALE  TO  UNIT  VARIANCE,  FOR  GOOD  DATA  POINTS 
S2=0. 

DO  3  1=1, N 

IFlIPUl  ,EO.O)  GO  TO  3 
X(I)=XU1-S1 
S2=S2+X ( I) **2 

3  continue 

S2=SGRT(S2/(N-B-1.) ) 

T=l./S2 
Do  s  1=1, N 

IF(IPU).EG.O)  GO  TO  5 
X ( I ) =X ( I ) *T 
Y ( I ) =X ( I J 
CONTINUE 
BEGIN  RECURSION 
P=0 

PR00UC=1 , 

AICM1N=C, 

PoEST=0 
FRO(j=l. 

P=p*l 

oPOMfc  IP  ARRAY  I  EQ.  173 
DO  24  L=l,d 
I=MlL)*P 

IFII.GT.N)  GO  TO  24 
IP(I)=0 
,4  CONTINOE 

Bp=0 
L=P*1 

CO  25  l=t,N 
25  BP=BP4l-IP(D 

KSN-P-OP 

IFlK.LT.25i  Print  26,  K.P 

tb  Format i/ •  nomber  of  valiO  error  points  is  only*, 13.*  for  p=*»i3> 

C  CALCoLATE  CROSS-GAINI  eg,  i93 
SasO.OO 
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S8=0.00 

L=P+1 

00  7  I=L*N 

IFUPtlX.EG.O)  SO  TO  7 
Sa=SA+XU)*T(I-X) 

So=SB+X { I >  **2*. Y ( 1-1 ) **2 
7  CONTINUE 

0=2, *S A/SB 

PRODoCsPROOuC*  1 1 . -G*S ) 

c  CALCULATE  FILTER  COEFFICIENTS)  EOS,  1954196.  STORE  IN  . . . 

XvPjsG 

IF(P.EO.l)  GO  TO  6 
L=P/2 
00  9  1=1 ■ L 
1=A(I)-G*XIP-I) 

XiP-I)=X(P-I)-G*X(I) 

4  Xii)=T 

c  calculate  normalized  correlation  coefficient i  eq.  149 

6  T=A(P) 

IF(P.Ew.l)  GO  TO 
L=P-1 

00  15  1=1 * L 

15  T=TfX(i)*RMOiP-I) 

14  RHO(P)=T 

C  CAlCulaTE  AKAIkE«S  INFORMATION  CRITERION  EOS.  1944202 
KELERRs ( 1 . -G*G ) *SNGL  J  SB )  /  <  2  .  *K ) 

AlCsLOG ( KELERR ) +2 . *FLOAT  <P ) /*< 

IF(AIC.GE.AICMIN)  GO  TO  10 

A1CMIN=AIC 

PdEST=P 

PHOO=PROOUC 

00  11  I=1#P 

11  A(I)SXII) 

10  IF(P.EG.PMAX)  GO  TO  16 

C  UPDATE  FORWARD  AnD  BACKWARD  SEQUENCESl  Efl.191 
L=P+1 

00  12  I=N.L,-1 

IF  (IP(I) ,EU,0)  60  TO  12 

T=X(I)-G*T(I-1) 

TU)SY<I»11-G*X(I) 

X(1)ST 

12  Continue 
T(P)=G 
60  TO  6 

16  Y(PMAX)=G 

IF ( PoEST ,Eu ,PMAX )  GO  TO  4 
L  COMPUTE  extrapolated  normalized  correlation 
c  COEFFICIENTS  from  P0EST+1  TO  PMAXI  EQ.  165 
L=P3£ST+1 
Uo  17  p=l*pmax 
AiP)=0. 

T=0. 

DO  18  I=1«PBEST 

16  T=T*A(I)»RHO<P-I) 

17  Rnu (P) =T 

4  A l N) =S1 

Y (N)=S2 

RETURN 

EnU 
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SU8R0UT : NE  PO WER  S ( P8E  S  T  #  A , PR  00 » J  #  XX » Y Y . C  0 } 

C  THIS  SUBROUTINE  COMPXffEfi  THE  FRACTIONAL  POWERS  IN  BANOS  1/(J*0ELTA) »  EQ,  J-4 
C  PBEST  s  BEST  ORDER  OF  FILTER*  INTEGER  INPUT 

C  A ( 1 . . .  =  FILTER  COEFFICIENT  ARRAY*  INPUT 

C  PROD  s  PRODUCT U-A(P)**i»)  FOR  P=1  TO  PBEST*  INPUT 
C  U  s  SIZE  OF  FFT  <J/2*1SNUM8£R  OF  FREQUENCY  POINTS)*  INTEGER  INPUT 
C  XX  S  AUXILIARY  ARRAY  ON  INPUT 

C  XX ( 1 ) f , , , > XX (J/2+1 )  s  FRACTIONAL  POWERS  ON  OUTPUT 
C  YY  s  AUXILIARY  ARRAY*  SCRATCH  INPUT 

C  C0(1) »,,.*C0(J/4+l>  s  QUARTER  COSINE  TABLE  FOR  FFT*  INPUT 
C  DIMENSION  XX (j) # YY ( J) iCO( J/4+1)  IS  REQUIRED  IN  MAIN  PROGRAM 
C  DIMENSION  A(PMAX)  IS  REQUIRED  IN  MAIN  PROGRAM#  WHERE  PMAX.GE, PBEST 
INTEGER  PBEST 

DIMENSION  A(l) #XX(1) #YY(1) #C0(1) 

F=PR0Q*2,/J 

XXUJSI, 

YY(l)sO, 

DO  1  Isl»PBEST 
XXa+l)=-A(I) 

1  YY ( 1+1 ) =0 • 

L=P8EST-#2 
DO  2  I=L>U 

XX<I)=0, 

2  YY(I)=0, 

L=I.4427*L0G(J)>.5  L0G2 ( J) 

CALL  MKLFFt(XX»YY,C0#L,-1) 

L=U/2*1 
DO  3  1=1. L 

3  XX ( I )=F/(XX( I )**2*YY ( I ) **2) 

RETURN 

ENO 
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FORTRAN  Program  for 
Multivariate  Linear 
Predictive  Spectral 
Analysis,  Employing 
Forward  and  Backward 

Averaging 


Albert  H.  Nuttall 

ABSTRACT 


A  FORTRAN  program  for  multivariate  linear  predictive 
spectral  analysis,  employing  forward  and  backward 
averaging,  is  presented.  The  program  is  written  for 
general  M,  where  M  is  the  number  of  processes,  with 
the  exception  of  an  internal  function  and  three  in¬ 
ternal  subroutines  which  are  written  for  M  =  2  in 
this  version  of  the  program,  but  can  easily  be 
generalized  to  general  M.  This  program  generalizes 
Burg's  algorithm  to  the  multivariate  case.  The 
theory  behind  this  program  will  be  published  in  a 
forthcoming  NUSC  Technical  Report. 
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FORTRAN  PROGRAM  FOR  MULTIVARIATE  LINEAR 
PREDICTIVE  SPECTRAL  ANALYSIS,  EMPLOYING 
FORWARD  AND  BACKWARD  AVERAGING 


INTRODUCTION 


The  Burg  algorithm  for  spectral  analysis  has  proven  to  be  a  very  attractive 
method  for  a  univariate  process.  *  Extension  to  a  multivariate  process  has  been 
desired  for  some  time  and  has  now  been  accomplished,  and  is  documented  here 
in  the  form  of  a  FORTRAN  program.  Publication  of  this  program  will  make 
immediately  available  to  those  interested  users  a  powerful  method  of  spectral 
analysis;  the  theory  behind  this  program  will  be  published  soon  in  a  NUSC 
Technical  Report.  The  basic  analytical  problem  was  to  minimize  the  sum  of 
the  traces  of  the  forward  and  backward  error  matrices  by  choice  of  the  partial 
correlation  coefficients,  subject  to  a  linear  matrix  constraint  which  guaranteed 
that  the  forward-extrapolated  and  backward-extrapolated  correlation  estimates 
were  Hermitians  of  each  other.  Solution  of  a  bilinear  matrix  equation  is  required 
in  the  process. 

It  has  just  come  to  the  author's  attention  that  a  similar  procedure  has  been 
presented  by  R.  H.  Jones. ^  Comparison  of  the  details  of  the  two  procedures 
and  programs  has  not  been  undertaken  yet. 


*A.  H.  Nuttall,  "Spectral  Analysis  of  a  Univariate  Process  with  Bad  Data 
Points,  via  Maximum  Entropy  and  Linear  Predictive  Techniques, "  NUSC 
Technical  Report  5303,  26  March  1976. 

*R.  H.  Jones,  "Multivariate  Maximum  Entropy  Spectral  Analysis, "  Applied 
Time  Series  Analysis  Symposium,  Tulsa,  Oklahoma,  14-15  May  1976. 
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Multivariate  Linear 
Predictive  Spectral 
Analysis  Employing 
Weighted  Forward  and 
Backward  Averaging: 
A  Generalization  of 
Burg’s  Algorithm 

Albert  H.  Nuttall 


ABSTRACT 


A  method  for  multivariate  linear  predictive  spectral 
analysis,  employing  weighted  forward  and  backward 
averaging.  Is  presented  and  programmed  in  FORTRAN. 
The  method  constitutes  a  generalization  of  Burg's 
univariate  algorithm  to  the  multivariate 'case.  The 
essential  analytical  procedure  is  to  minimize  the 
trace  of  the  sum  of  the  weighted  forward  and  back¬ 
ward  error  matrices  by  choice  of  the  partial  corre¬ 
lation  coefficients,  subject  to  a  linear  matrix 
constraint  which  guarantees  that  the  forward-extra¬ 
polated  and  backward-extrapolated  correlation  matrix 
estimates  are  Hermitians  of  each  other.  The  choice 
of  error  weighting  is  Important  and  Is  discussed. 
Solution  of  a  bilinear  matrix  equation  is  required 
in  the  algorithm. 
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INTRODUCTION 

Spectral  analysis  of  stationary  random  processes  via  linear  predictive, 
maximum  entropy,  and  autoregressive  techniques  has  attracted  much  attention 
lately,  especially  for  short  data  segments;  see,  for  example,  the  biblio¬ 
graphies  listed  In  referencesl,  2,  and  3.  For  a  univariate  process.  It 
appears  that  the  Burg  algorithm  (Ref.  4),  which  guarantees  a  stable  correla¬ 
tion  recursion.  Is  as  good  as  any  of  the  currently  available  techniques  of 
similar  nature  that  employ  an  all -pole  model  of  the  available  process  (Ref.  3). 

Accordingly,  It  Is  desirable  to  develop  a  spectral  analysis  technique 
for  the  multivariate  case  In  such  a  way  that:  we  employ  a  physically  mean¬ 
ingful  error  minimization  for  the  determination  of  the  filter  coefficients; 
the  technique  yields  a  stable  correlation  recursion;  and  it  reduces  to  Burg's 
algorithm  for  the  univariate  case.  It  will  be  shown  In  the  following  that 
we  have  accomplished  these  goals,  with  the  exception  that  we  have  not  proved 
(or  disproved)  the  stability  requirement.  A  FORTRAN  program  for  this  spectral 
analysis  technique  was  published  In  Ref.  5,  along  with  an  example  of  Its 
application.  Virtually  simultaneously,  the  same  technique  was  Investigated 
Independently  and  published  In  Ref.  6.  In  this  report,  we  will  document  the 
derivations  and  equations  that  lead  to  the  program  presented  In  Ref.  5,  and 
Indicate  an  extension  of  that  result. 

Our  approach  In  this  report  will  be  to  Investigate,  In  some  detail,  first 
the  case  where  the  correlation  of  the  multivariate  process  under  consideration 
Is  known  for  a  limited  range  of  argument  values,  and  to  extract  all  the 
relevant  Important  properties  of  the  solution  so  that  they  may  be  forced  to 
be  satisfied  later  when  we  treat  the  unknown  correlation  case.  This  property- 
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extraction  procedure  Mill  be  found  to:  furnish  guides  to  the  analysis  of 
the  unknown  correlation  case;  allow  us  to  cut  down  on  computer  execution 
time  and  storage  by  employing  the  properties;  and  make  us  aware  of  some  of 
the  shortcomings  of  the  unknown  (versus  known)  correlation  cases.  This 
procedure  should  also  be  helpful  to  those  who  are  not  thoroughly  familiar 
with  spectral  analysis  of  multivariate  processes  and  their  properties. 

Throughout  this  report,  we  assume  we  are  dealing  with  equi spaced 
samples  of  a  stationary  zero-mean  complex  random  process  X(t)  of  dimension¬ 
ality  M;  that  is,  sample 

xw*x  •Oe-'tff  (') 

is  an  M  x  1  column  matrix,  where  A  is  the  common  sampling  interval  for  all 
the  component  processes  of  X(t).  It  Is  not  assumed  that  X ( t)  is  Gaussian. 

In  section  2,  we  will  assume  that  the  correlation  matrix  of  process 
{Xn},  namely  the  M  x  M  matrix* 

K.  *  x„  xJi  *  *r"  (2) 

is  known  exactly  for  a  limited  range  of  values  of  k,  and  will  show  how 
an  approximation  for  the  spectrum  of  process  (Xn)  can  be  obtained.  In 
section  3,  the  input  correlation  matrix  will  be  unknown,  and  all  that 

Is  available  is  a  finite  set  of  N  data  samples,  X,,  x2 . XN,  from 

which  an  estimate  of  the  spectrum  of  process  {Xn}  is  desired.  The  end 
result  will  be  a  FORTRAN  program  for  multivariate  spectral  analysis. 


*The  case  of  complex  samples  is  treated  so  that  we  can  handle  complex 
envelope  or  complex  demodulated  processes.  Specialization  to  real  processes 
is  immediate,  and  (2)  becomes  An  overbar  indicates  an  ensemble 

average,  superscript  T  denotes  a  transpose,  and  superscript  H  denotes  a 
conjugate  transpose.  Matrices  are  indicated  by  capital  letters. 
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2.  KNOWN  CORRELATION 


If  the  correlation  in  (2)  is  known  for  all  k,  the  standard  (double¬ 
sided)  definition  of  the  spectrum  of  process  (Xn)  is 

etfi  *  *  r,  ,  (fi-ir  ■  <3) 

The  complex  M  x  M  matrix  G(f)  is  Hermitian  and  non-negative  definite  for 
any  value  of  frequency  f  (see  appendix  A),  but  need  not  be  even  in  frequency 
f.  When  Rk  is  not  known  for  all  k,  but  only  for  a  range  |k)sp,  an  approxi¬ 
mation  to  (3)  must  be  accepted;  this  problem  will  be  pursued  below. 

2.1  DERIVATION  OF  EQUATIONS 

Suppose  M-dimensional  samples  Xk_p,  •••»  are  available,  and  we 
attempt  a  one-step  linear  prediction  of  Xk  according  to  the  p-th  order 
operation 

X,  *  .  <«> 

where  complex  coefficient  matrix  An  is  M  x  M,  n  *  1 ,  2,  ....  p.  The 
instantaneous  error  at  time  kA  is  defined  as 

X*  X.-X,  *-  ,  A*-r-  (5) 

The  linear  operators  in  (4)  and  (5)  constitute  stable  linear  filters  regard¬ 
less  of  the  choice  of  coefficients;  the  filter  of  (4)  is  called  the  predictive 
filter,  that  of  (5)  is  called  the  predictive  error  filter.  Notice  that  we  are 
not  assuming  that  process  (Xn)  actually  satisfies  an  autoregressive  relation; 
rather  we  are  simply  attempting  to  linearly  predict  (Xn }  on  the  basis  of  the 
most  recent  p  past  values. 
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The  minimum  value  of  the  scalar  error 

p 

by  choice  of  coefficients  {An>lt  is  given  (in  appendix  B)  by  the  solution 
of  the  linear  matrix  equations 

where  the  explicit  dependence  on  the  order  p  is  indicated.  Knowledge  of 
Rk  for  )k|* f  is  required  in  (7). 


H  f 

Before  we  discuss  the  solution  of  (7)  for  {A^}(  ,  we  consider  one-step 

linear  "backward  prediction"  of  process  { Xn } .  Suppose  samples  Xk,  Xk_i . 

xk-p+l  are  availab^e»  and  we  attempt  a  one-step  linear  prediction  of  Xk_p 
according  to 


SB,  X 


(8) 


The  instantaneous  error  is  defined  as 


.  V-r  (9 

The  minimum  value  of  the  scalar  error 

11 

by  choice  of  coefficients  {Bn}?,  may  be  shown  (in  a  manner  similar  to  that 
of  appendix  B)  to  be  given  by  the  solution  of  the  linear  matrix  equations 

,  I4  **  r-  (1 

mrn  I  ^  ' 

For  the  optimum  coefficients  in  (7)  and  (11),  we  find  (see  appendix  B) 
that  the  optimum  error  matrices  take  the  form 


4 


TR  5501 


ft  xv;  =  K-  £a!X  *  UP  ,  U»*, 

ft£zr  .«,-±bTr.*  Vr ,  \fc«x..  (12) 

In  general,  these  two  matrices,  their  diagonal  elements,  and  their  traces  are 
unequal  (as  the  simple  example  of  p=l  will  show).  However,  their  determinants 
are  equal,  as  will  be  shown  in  subsection  2.2. 


The  solutions  of  (7)  and  (11)  can  be  accomplished  simultaneously  in  a 
recursive  fashion  (Ref.  7).  Define 


,  bT-x. 

(13) 

Then 

and 

<*cr,vr:; 

,  er-  u;: 

(14) 

|  1 t  n  s  )>->  (p  *  *)  • 

1 

(15) 

These  relations  will  be  simplified  somewhat  in  subsection  2.2.  For  M=1 , 
a  univariate  process,  (7)  and  (11)  immediately  yield 

tf-D?  U  M-l, 


where  we  have  used  (2)  in  the  form  R,  *  R*  for  a  univariate  process.  No 

k  -k 

such  simple  relation  as  (16)  holds  for  M  >  2. 


We  will  now  derive  a  chain  interpretation  of  the  above  results  that  will 
prove  very  useful  later  when  we  have  to  deal  with  the  unknown  correlation  case. 
For  the  optimum  filter  coefficients  and  the  p-th 

order  forward  and  backward  residuals  (see  (5)  and  (9))  as  the  outputs  of  the 
forward  and  backward  predictive  error  filters: 
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xT-iAr  . 

zT  •  -  ±  sf  5U.  -  x*,-  bX„  ~  •  - 1 X, 


*so  -r 

Then  using  (15),  we  can  express 


07) 


And  similarly 


^  v|a!X-aX, 

zr-  v  -  bX 

.zr^|Arx, -zT’-bx. 

^p)  .  -7<p) 


(18) 


(19) 


V^'P  “7  t-PJ 

k  and£^  are  re1ated  to  the(p-l)th  order  residuals 
simply  through  the  coefficients  and  6f.  A  block  diagram  of  the  relation¬ 
ships  in  (18)  and  (19)  is  given  in  figure  1,  where  2*!l denotes  an  M  x  M  matrix 
filter  of  unit  delay. 
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(O)  (o) 

Y  =  2  =  X, 


Figure  1.  Chain  Representation  of  Residuals 
Thus  matrix  operators  and  can  be  interpreted  as  those  coefficients 

r*r 


which  minimize 


if  -j  Hfzr 


(20) 


respectively,  at  the  output  of  the  p-th  stage  in  figure  1,  where^]^"  and 

are  determined  by  minimizations  at  lower  order  stages.  A**  and 
K  m  *\  P  P 

are  called  the  partial  correlation  coefficients.  Stated  alternatively,  stage 
by  stage  minimizations  of  (20),  via  choices  of  partial  correlation  coefficients 
%  and  ^  respectively,  results  in  the  same  overall  filter  as  if  the  powers 
in  p 

-  QhJ  (21) 


1»H 


were  minimized  by  the  choices  of  (An>?  and  {Bn>? ,  respectively,  each  in  one 
simultaneous  optimization.  This  will  furnish  an  important  reference  point 
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for  the  unknown  correlation  case  in  section  3. 

If  we  let  the  transfer  functions  (z- transforms)  to  the  outputs  of  the 
p-th  stage  in  figure  1  be  denoted  by  and  it  immediately 

n  9 

follows,  from  figure  1  or  equations  (18)  and  (19),  that 


nr**  -  , 

^w-b^w, 


=  X 


In  closed  form,  these  predictive  error  filter  transfer  functions  are  express- 
ible  as  (see  (17))  »  .  £  e~A*  .  j  .  ±£'f 

n  "«•  W-l  "  ) 

w  •  - 

9  n  ■  o  ^ 

J  =  0  '  ^ 

(2: 

L-  w*l  "  J  . 

2.2  PROPERTIES  AND  INTERPRETATIONS 

Suppose  that  process  {Xn}  were  scaled  according  to 

X*  -  t>  X.  l2‘ 

where  M  x  M  matrix  D  is  arbitrary,  but  invertible,  Then  the  correlation  of 
the  scaled  process  is 

t  *  x„  X„"  -DXXD^  XR.Xh.  (2! 

Now  from  (7),  since  the  solutions  u:)  and  sen  must  satisfy 

is  (2f 

Its)  •  )  •  * 
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iLAfLrt ,  's"sr> 


(27) 


respectively,  the  solutions  are  related  by  a  similarity  transformation: 

This  is  called  the  scaling  property.  A  similar  property  holds  for  the  back¬ 
ward  coefficients  fan. 

An  immediate  by-product  of  the  scaling  property  is  that  A^  and  A^  have 


the  same  eigenvalues: 


<Ut(£T-  XT)  *  detMTlf'- Xt)  *  de-t^T-Xx) . 


(29) 


Similarly,  B^*  and  have  the  same  eigenvalues,  regardless  of  scaling  matrix 
n  n 

D. 


The  remainder  of  this  subsection  will  deal  with  the  quantities  Up  and 
Vp  defined  in  (12),  and  Cp  and  Dp  defined  in  (13).  The  quantity  Up  can  be 
interpreted  physically  as  the  correlation  matrix  of  the  p-th  order  forward 
residual;  see  (12),  (5),  and  (17).  Similarly,  Vp  is  the  correlation  matrix 
of  the  p-th  order  backward  residual;  see  (12),  (9),  and  (17).  That  is, 


or  -  y?  V"  ,  %  -  . 


Thus  U  and  V  are  Hermitian: 
P  P 


Uf-VV  ;  v;=Vr  ; 


(30) 


(31) 


and  Up  and  Vp  are  non-negative  definite: 

I  I  A  1  /N  /H  V  ill  «.  H 


>  o 


(32) 


for  any  M  x  1  matrix  .  In  appendix  C,  it  is  shown  that  simple  recur¬ 


sions  hold  for  Up  and  Vp: 
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(33) 


o,  ■  (i-  ,  u-*., 

V(i-**)V.,  v-*- 

It  immediately  follows  from  (33)  that  (see  appendix  C) 

=  dei  Vf  ,  f  *  0. 

This  property  was  proved  in  Ref.  8,  page  240. 


The  quantity  Cp  defined  in  (13)  can  oe  interpreted  as  the  cross-corre¬ 
lation  matrix  between  the  p-th  order  forward  and  backward  residuals  at  one 
unit  of  delay  “  p  p  “  ■  -  u 


(34) 


K  —K-l  H*0 

where  we  have  used  (17),  (2),  (7),  and  (13),  Similarly 

iH 


(35) 


(36) 


K«0  '  *»*© 

where  we  have  used  (17),  (2),  (11),  and  (13).  It  imnediately  follows  fron 


(35)  and  (36)  that 


DrH  -  Cr  . 


(37) 


Thus  it  is  not  necessary  to  do  the  additional  calculation  of  Dp  in  the 
solution  given  in  (13) . 

Another  interpretation  of  Cp  is  available  as  follows: 

I  Cf,  *=>•*-' 

where  we  have  employed  (17),  (2),  (12),  (7),  and  (13)  in  order.  Thus  the 
p-th  order  forward  residual  is  uncorrelated  with  the  p  most  recent  past 


(38) 
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values  of  the  input,  and  the  crosscorrelation  at  p  ♦  1  units  of  delay  is 
just  Cp.  Similarly,  the  backward  residual  satisfies 


/  Dp, 


Yet  another  interpretation  of  Cp  and  Dp  will  be  given  in  subsection  2.3. 

As  the  order  p  in  the  linear  prediction  (4)  increases,  (38)  yields 

e-TC-lVl  “  f— 

[0,  ■»« 

Therefore  the  autocorrelation  matrix  of  the  forward  residual  becomes 

-  p--  ... 

That  is,  p-th  order  residual  yJ* tends  to  white  noise  with  a  correlation 
matrix  at  zero  time  delay  of  value  U,,,  which  Is  not  necessarily  diagonal. 

The  Hermitian  property  in  (37)  allows  us  to  combine  (14)  into  the 


equation 


^  > 
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where  we  utilized  (31).  This  constraint  on  the  partial  correlation  coeffi¬ 
cients  will  be  of  paramount  importance  in  the  unknown  correlation  case.  It 
Immediately  follows  from  (42)  and  (34)  that 

A**  =  (l.t $)* 

No  such  simple  relation  holds  between  det  Ayf  and  det  for  np,  except 
for  M  *  1,  a  univariate  process. 

2.3  EXTRAPOLATION  OF  CORRELATION  VALUES 

In  subsection  2.1,  we  minimized  the  er»-or  in  prediction  '4'  one  fo^nc 
tnat  for  a  p-th  order  prediction,  knowledge  of  R^  for  |k|»p  *as  reduced; 

see  ;7 Now  suppose  that  this  is  all  the  knowledge  avai’at'e  %  . 

that  is,  suppose  Rfc  is  unknown  ‘or  k  -p  What  can  pe  done  acOu*.  a.-.*-:*  'V 
these  unknown  values? 

One  approach  is  as  follows:  we  assume  that  the  p-th  order  residua' 
process  {yf}  ln  i!7)  is  white  (i.e.,  uncorrelated  for  a:*  ncr - ;er'_  oe'a.s  an 
that  ^’*0;  otherwise  we  could  reduce  the  value  of  p  .  "hat  is.  we  «e 

can  do  nothing  more  in  prediction  by  choosing  more  terms  m  the  Sur  4  . 
which  is  tantamount  to  assuming  maximum  uncertainty  (entropy1  a&Out  tnt 
residual  process  This  is  a  very  extensive  assumption,  we  no*  '"vest'  - 

gate  its  ramifications. 

we  know  krom  38'  that 
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£  *  Txl  *  -  ,  «ii  *.  1441 


must  satisfy 


TjT  s  0  for  U  m  if. 


(45) 


Additionally,  enploying  (17),  the  autocorrelation  matrix  of  the  p-th  order 
residual  is 

(46) 

*  *-J  m*0  *  *»*0  j  J 

Now  for  j  •  1 ,  the  whlu  noise  assumption  on  process  {^yields,  via  (46) 

and  (45). 

c-  ®- 

And  for  j  *  2,  the  white  assumption  (In  conjunction  with  (47))  yields 

£'t> 


(47) 


(48) 


Continuing  In  this  way,  the  white  assumption  Is  tantamount  to  assuming  that 

(49) 


FjJ*  *•  0  fcr 


)  S  *. 


Returning  to  expression  (44),  this  means  that  we  are  assuming  that 


that  Is, 


mao  ' 

Xm  »  ^  At*  *Rr  ,  p+)  S  m 


(50) 

(51) 


Using  more  explicit  notation,  and  denoting  these  assumed  values  of  correlation 
as  forward  extrapolations  ,  we  have 
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^!T  *  ,  p+i  * 


where  “starting  values' 


qV  V 

f  0  5  *  S  p 


(52) 


;53 


Equation  ,52)  is  called  the  correlation  recursion  equation.  t  is  interesting 
to  note  tnat  the  ‘ora  of  the  correlation  recursion  .52)  is  dentical  to  the 
*orm  '4,  for  the  individually  predicted  wavefom  values. 

The  correlation  values  in  (52)  are  called  the  maximum  entropy  correlation 
extrapolations.  The  recursion  is  stable  if  and  only  if  (see  (23)) 

d.t  (r  -  i  x  *AD  * 

possesses  all  its  zeros  within  the  unit  circle  in  the  complex  z-plane. 
this  property  will  be  treated  in  subsection  2.4. 

A  similar  procedure  for  backward  correlation  extrapolation,  assuming 
that  residual  process  [j£]  is  white,  yields 


«» i  *  »  f 


•M  f  m  ; 


where 


»  R.. 


Cams 


(56) 


14 


TR  5501 


Backward  recursion  (55)  Is  Identical  In  form  to  the  backward  prediction  (8). 
The  recursion  (55)  is  stable  if  and  only  if  (see  (23)). 

*t(T-irtT).  (57J 

possesses  all  its  zeros  within  the  unit  circle. 

As  a  special  case  of  (52)  and  (53),  the  one-step  forward  extrapolated 
correlation  based  on  a  p-th  order  prediction  is 

But  from  (13),  we  now  can  see  that 

C  *-iL  /*1L_  *"R  -  V<f>  <»' 

v-r  rV'  V* 

That  is,  Cp  is  the  difference  between  the  true  correlation  value  Rp+i  and  the 
one-step  forward  extrapolated  correlation  R^  based  upon  knowledge  of  (Rk^p 

A  similar  procedure  shows  that 

Ty  *  R_,_,  -  (mi 

That  is,  Dp  is  the  difference  between  the  true  correlation  value  R_p_i  and 
the  one-step  backward  extrapolated  correlation  based  upon  knowledge 

of  (Rk)0 

When  (59)  and  (60)  are  combined  with  the  Hermitian  property  In  (37),  we 
see  that 


t?"  ■  V 
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This  Is  a  special  case  of  the  more  general  property  (demonstrated  in  appendix 


0)  that 


(62) 


that  is,  the  backward  and  forward  extrapolated  correlation  matrices  are 
Hermitians  of  each  other.  This  is  a  desirable  property  of  the  extrapolations 
and  is  consistent  with  the  same  property,  (2),  which  holds  for  the  known 
correlation  values, 

It  was  noted  in  (54)  and  (57)  that  the  zeros  of  detflf^(z)  and  det flf,(z"^) 
must  be  within  the  unit  circle  in  order  that  recursions  (52)  and  (55),  respect¬ 
ively,  be  stable.  It  is  shown  in  appendix  E  that 

J«Kt-±.-AT)-  <w(i-ii"B?M)  (63> 

•«  i  /  '  w»  t  /  • 

That  we  need  consider  only  the  zeros  of  one  of  these  quantities;  the  location 
of  these  zeros  is  considered  below. 


It  is  also  shown  in  appendix  E  that 

y »:  •  (t.  b :J 

m  a’;  •  b;/ 

2.4  SPECTRAL  APPROXIMATION 


.64) 


(65) 


Equations  (52)  and  (53)  define  the  forward  extrapolated  correlations  for 
all  m>0.  We  extend  these  to  negative  m  via 


*  *  0, 


(66) 
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which  Is  consistent  with  (2).  We  will  now  use  the  Fourier  transform  of  this 
Infinite  sequence,  as  In  (3),  as  an  approximation  to  the  spectrum  of  process 
{Xn>.  In  appendix  F,  it  is  shown  that  the  approximate  spectrum  is  given  by 


where 


(f  w  *  -  W  u,  W  ,  ffi  <  i 

HaW  *  -  *Xf(- 


(67) 


(68) 


Is  the  forward  predictive  error  filter  transfer  function.  Since  Up  is  non¬ 
negative  definite  by  (32),  spectral  approximation  G^(f)  is  nonnegative 
definite  for  any  f;  it  is  also  obviously  Hermitian  by  (31).  Thus  the  desir¬ 
able  properties  of  appendix  A  are  achieved  by  approximation  (67).  In  order 
to  evaluate  (67),  one  M  x  M  matrix  inverse  (of  (f))  is  needed  at  each 
value  of  f  of  interest. 


A  similar  procedure  applied  to  the  backward  correlation  recursion  of 
(55)  and  (56)  yields  the  spectral  approximation 

Gftf)  •  a H*(f)"  V,  H?(f m 

where  . 

«xp(-i  (70) 

is  the  backward  predictive  error  filter  transfer  function.  Since  the  extra¬ 
polated  correlations  via  (52)  or  (55)  are  equal,  as  shown  in  subsection  2.3, 
the  same  notation,  G^(f),  is  used  for  both  (67)  and  (69);  however,  we  have 
two  different  factorizations  for  the  unique  spectral  approximation  G^P^(f). 

In  appendix  F,  it  is  also  shown  that  the  zeros  of  det^(*)  .(see  (22) 
and  (23))  all  lie  inside  the  unit  circle  in  the  complex  z-plane.  Additionally, 
the  poles  of  all  lie  inside  the  unit  circle,  and  the  zeros  of^(j)-1 
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all  lie  at  z  *  0.  Thus  the  recursion  (52)  is  stable.  This  point  is 
discussed  in  Ref.  7,  p.  132. 

2.5  EXAMPLE 


A  simple  example  for  M  =  2  will  be  considered.  Let  the  process  be 


generated  according  to 


where 


<r  - 


.?5  -.15- 

.65"  .55- 


and  white  noise  Wk  satisfies 


XVC  .  S„  X. 


Then  it  may  be  shown  that 

K *  *■  x ,  »io, 

with  solution 


25  1 35 

x,  * 

n.ii? 

-12.0  ‘I'T 

4.162 

2 1.  643_ 

) 

non 

15  064_ 

By  means  of  (7)  and  (11),  we  find 


*<■> 


"iff 

-15 

.  55^30 

.  152-iR 

.65 

.5*- 

) 

-  64400 

.  *4020 

and  Af*  A? ,  A?5  0,  2  s  d  s  p  .  We  observe  4?  *  ifX  4  , 

and  V  $,*.  The  determinants  of  (76)  are  both  .955. 


Evaluation  of  (12)  gives 

H  •  i ,  V- 


2IA34  M?C5<* 


m 


/  „ 


:73 


(75) 


(76, 


(77) 
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These  matrices  and  their  traces  and  eigenvalues  are  unequal,  but  their 
determinants  are  both  1. 


3.  UNKNOWN  CORRELATION 


In  this  section,  the  correlation  values  {R^}  are  unknown,  and  the  only 

information  available  about  the  random  process  is  a  finite  set  of  N  data 

points  X  ,  X  ,  X.,,  from  which  we  have  removed  the  sample  mean.  From 

1  2  N 

these  N  data  points,  we  desire  an  estimate  of  the  spectrum  G(f).  But  we 
cannot  minimize  or  utilize  any  ensemble  averages  as  was  done  in  section  2, 
since  we  have  only  a  finite  segment  of  one  member  function  to  work  with. 

3.1  PHILOSOPHY  OF  APPROACH 


For  the  known  correlation  case  above,  we  had  the  set  of  normal  equations 


£a!X.  -  O 

I 

idX.  =  1L 


IS**  f> 


(78A) 

(78B) 


where  and  {B?}'  were  the  unknowns.  Now  in  the  unknown  correlation 

case,  we  make  a  change  by  assuming  that  and  are  known*  (along  with 

for  \m)s  p-l  ,  from  lower  order  solutions),  and  by  letting  1?f  andKf  be  unknown. 
The  equations  in  the  unknowns  are  still  linear,  and  the  solution  is  given  by 


1  f 


(79A) 

(79B) 


*The  manner  of 


specifying  and|£ 


will  be  considered  In  subsection  3.4. 
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(80A) 

V  • 

(80B) 

(It  must  be  noted  thatl^in  this  section  denotes  an  estimate  of  the  true 
(unknown)  correlation  value;  for  notational  convenience,  no  distinguishing 
symbol  has  been  added  to  1?.  to  emphasize  this  distinction.)  However,  we 
shall  insist  that  the  correlation  estimates  (80)  that  we  obtain  at  the  p-th 
stage  satisfy 


(81) 


in  keeping  with  property  (2).  Since  equations  (78)  and  (81)  are  identical 
to  those  encountered  in  the  known  correlation  case,  the  mathematical  defini¬ 
tions  and  interrelationships  employed  there  can  be  applied  here  also.  How¬ 
ever,  some  of  the  properties  and  physical  interpretations  may  be  different, 
since  we  are  now  dealing  with  estimates,  rather  than  true  values. 


(82) 


(83) 


To  solve  (78),  we  begin  by  defining 

Now  consider  p=l  in  (78);  we  have 

aX  -  %  ,  -  R-, . 

Now  if  and  8^’  are  known,  we  can  compute  unknowns  K,  and  R_, .  But  by 
constraint  (81),  and  must  be  chosen  such  that 

AfK  •  ■  (84) 

Thus  when  we  select  and  B?,  constraint  (84)  must  be  kept  in  mind;  that  is. 
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A?  and  $  cannot  be  specified  independently  of  each  other. 

At  stage  p(^2),  if  Ap  is  known  (and  are  known  from  earlier 

stages  with  property  0***  f-i ) ,  we  could  solve  the  linear  equations 

(78A)  for  iAt’lf  and  “Rj,,  according  to  (79A)  and  (80A),  where  the  lower  order 
quantities  in  (79)  and  (80)  are  available  from  earlier  stages.  Similarly 
if  is  known,  we  use  (79B)  and  (80B)  to  solve  (78B).  However,  by  (81), 

u  w 

we  must  constrain  the  selection  of  Ap  and  Bp  . 


To  see  exactly  what  constraint  (81)  implies  about  the  selection  of 
and  notice  that,  for  p>2,  (and  defining  -X) 

v£«  iv.-Sr-A?ov^ 


1 


(85) 


where  we  have  employed  (80A)  and  (79A).  Now  define  forward  extrapolated 
correlation  estimates  based  on  order  p-1  according  to  (see  (52)  and  (53)) 

if*  #aVC>  «*r. 


(86) 


M«  I 


where 


*  K  ,  o  *»*/>-'• 


(87) 


Then,  in  particular,  the  one-step  forward  extrapolated  correlation  estimate 
based  on  order  p-1  is 


if  =  5L/fi£  -  • 


(88) 


Also  define  (see  (12)) 
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r  '  n«o  ** 


(89) 


This  quantity  has  the  physical  interpretation  as  the  estimate  of  the  corre¬ 
lation  matrix  of  the  (p-l)th  order  backward  residual  at  zero  time  delay 
(see  (30));  its  properties  are  considered  in  subsection  3.3.  Then  by  means 
of  (88)  and  (89),  (85)  can  be  expressed  as 

*r  =  Aty., .  (90) 

(This  equation  is  similar  to  a  combination  of  (14)  and  (59)  for  the  known 
correlation  case. ) 

At  the  same  time,  by  (80B)  and  (798)  (and  defining^  =-l), 


(91) 


Now  define  backward  extrapolated  correlation  estimates  based  on  order  p-1 
as  (see  (55)  and  (56)) 

>-l 


it' ■ 


(92) 


where 


it0  s  "R  o  £  "  s 

-*w  -m  ) 


2  P~i 


(93) 


Then,  in  particular, 
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Also  define  (see  (12)) 


(95) 


This  quantity  is  an  estimate  of  the  correlation  matrix  of  the  (p-l)th  order 
forward  residual  at  zero  time  delay  (see  (30)).  Then  by  means  of  (94)  and 
(95),  (91)  can  be  expressed  as 


+  B*  Ur, 


(96) 


(This  equation  is  similar  to  a  combination  of  (14)  and  (60)  for  the  known 
correlation  case.)  But  now  it  can  be  shown  (see  appendix  G)  that  the 
extrapolated  correlation  estimates  in  (88)  and  (94)  satisfy 


(97) 


Therefore,  if  (81)  is  to  be  satisfied,  (90)  and  (96)  in  conjunction 
with  (97)  force 


(98) 


(This  reduces  to  (84)  for  p=l . )  Thus  the  selection  procedure  of  ^  and 
at  the  p-th  stage  must  be  done  according  to  (98),  where  Vp^  and  Up.,  are 
quantities  already  available  from  the  (p-l)th  stage,  according  to  (89)  and 
(95).  The  precise  selection  procedure  will  be  undertaken  in  subsection  3.4. 
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3.2  COMPARATIVE  FEATURES 

There  are  alternative  techniques  to  the  estimation  of  the  correlation 
matrices  and  the  spectral  density  matrix  that  could  be  considered.  For 
example,  the  standard  Yule-Walker  technique  (e.g.,  Ref  2,  page  186)  uses 
correlation  estimates 


(99) 


where  the  sum  is  over  all  nonzero  summands,  and  then  solves  recursively  for 
Vff,  and  {5^]^  via  The  method  in  subsection  2.1.  This  apriori  decision 
on  the  form  (99)  of  the  correlation  estimate  gives  poorer  spectral  estimates 
for  M=1  (Refs.  2  and  3),  and  probably  does  so  for  M>1.  The  estimated 
correlation  matrix  bu:  is  Hermitian,  block  Toeplitz,  and  nonnegative 
definite: 


r  i 


where  isMx|.  However  the  stability  of  the  correlation  recursion  (52) 
is  unknown  to  this  author.  The  estimate  (99)  is  unchanged  by  the  addition 
of  more  stages,  that  is,  larger  values  of  p. 
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Another  technique  would  be  to  minimize  the  prediction  error 

Y,  *  ,  p+1  S  k a  M  (4.--1)  noi) 

over  the  available  data  points  directly,  by  choice  of  .  We  have  the 

“  X YH  ,  ^ A  ^  ah 

(102) 


error  matrix 


-  ±a.\.  , 


where 


=  )  0  5  m>*  5  } 


(103) 


The  optimum  coefficients  for  minimum  trace  of  the  error  matrix,  (102),  are 
solutions  of 


(104) 


Matrix  [5m is  not  block  Toeplitz,  and  a  significant  computer  problem 
exists  for  M>1  when  it  is  noted  that  solution  of  linear  equations  (104)  must 
be  done  anew  for  each  different  value  of  p.  This  was  a  good  technique  for 
spectral  estimation  when  M=1  (see  Ref.  3);  however,  computer  time  was 
greater  than  for  the  Burg  technique.  Moreover,  stability  of  the  correlation 
recursion  (52)  is  unlikely  in  view  of  the  (occasionally  unstable)  results 
for  M=1  in  Ref.  3. 

This  technique  could  be  extended  to  include  backward  prediction  in 
addition  to  (101).  However,  the  lack  of  the  block  Toeplitz  property  and 
lack  of  stability  make  it  a  very  undesirable  technique. 
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The  technique  suggested  here  (in  subsection  3.1)  lets  the  correlation 
estimate  be  yielded  according  to  solution  (80),  once  partial  correlation 
coefficients  and  have  been  specified.  And  we  shall  see  in  subsection 
3.4  that  these  latter  quantities  are  determined  according  to  a  physically 
meaningful  minimization  problem.  Stability  of  the  correlation  recursion 
(52)  has  not  been  proved;  however,  numerous  examples  have  all  yielded  stable 
solutions.  The  estimate  (80)  is  unchanged  by  the  addition  of  more  stages, 
that  is,  larger  values  of  p.  And  it  will  be  seen  that  the  current  technique 
reduces  to  Burg’s  algorithm  (Ref.  4)  for  M=l.  Thus  the  current  tecr.riqje 
appears  to  be  very  attractive  among  those  techniques  that  employ  an  all -pole 
representation  of  the  input  process. 


3.3  PROPERTIES  AND  INTERPRETATIONS 


The  quantities  and  Vr,  were  defined  in  (95)  and  (89)  and  were 
interpreted  as  estimates  of  the  correlation  matrices  of  the  (p-l)th  order 
forward  and  backward  residuals,  respectively,  at  zero  time  delay.  It  is 
shown  in  appendix  H  that  they  satisfy  the  recurrence  relations 


Vlf-W  - 

V(r-<A;)Vr ,  V.-K  , 


(105) 


just  as  for  the  known  correlation  case,  It  is  also  shown  that 


if 


006) 


and 


<Mr  Up--  . 


(107) 
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However,  we  are  not  able  to  prove  or  nonnegative  definite  without 

ti  M 

specifying  the  method  by  which  and  are  selected;  no  relations  like 
(30)  and  (32)  exist  here. 


By  means  of  (106),  the  constraint  (98)  on  selection  of  and  takes 
the  form  (see  42)) 

(108) 


This  will  be  used  in  the  next  subsection. 


3.4  EVALUATION  OF  PARTIAL  CORRELATION  COEFFICIENTS 

We  recall  from  subsection  2.1  that,  in  the  known  correlation  case,  the 
partial  correlation  coefficients  ^  and  e|?  minimized 


(109) 


respectively,  when  lower  order  stages  had  already  been  optimized.  We  extend 
this  idea  to  the  unknown  correlation  case  as  follows:  let  (as  in  (18)  and 


(19)) 


.  X»  ,  ,  i*  MM, 


(110) 


and  for  I  ,  define  errors  (residuals) 

'C.'r-A'zr} 

sr-wj ' r 

The  block  diagram  for  (111)  Is  identical  to  that  in  figure  1  on  page  7. 


«  )c  *  N 


OH) 
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Define  for  p*l,  the  error  (residual)  matrix  over  the  available  data 


points  as 


E  »  —  ^ vV"  •  £> 

isi-f  K*p*»  K  *  r 


this  nonnegative  definite  matrix  is  an  unbiased  estimator  of  Y*  Yj^ 


Substitution  of  (111)  in  (112)  yields 


c-.  <:m/  j.  /(*<?*  a*h 


where 


.  _l_  jr  - 

JP-»  '  N-f  ufit  ‘a  '*  " 

*  -J _ yr)^,)H 

”  N-p  CT^i  Ik  ^-x-\  f 
<r-fr*)  B  _J _ 

*V'  5  NJ-f 


>-»  > 


"V 


(112) 


(113) 


( n  4  a  ) 


(1148: 


(114C) 


Also  define  for  pit  ,  error  matrix 


F  .  J-T  TPy*"  .  r" 

>  v-r  z  -  hr  • 


Substitution  of  (111)  in  (115)  yields 


(115) 


Fr  -  -Ef 5^  <+  . 

Now  error  matrices  Ep  and  Fp  are  Hermitian  and  nonnegative  definite. 
Therefore  matrix  A.E rA*  is  Hermitian  and  nonnegative  definite  for  any 
MxM  weighting  matrix  A4: 

YU  $  A!)y  *  (x yfc  (XY ?  o 


(116) 


(117) 
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for  any  Mxl  matrix  V*  Also  since 

4r(AEp-C)  =  -tr(^A,E;)  •  "MAEf),  die) 

only  the  product A«/£.A<1  matters  in  so  far  as  the  trace  of  is  concerned; 

notice  thatA  is  Hermitian  and  nonnegative  definite.  We  shall  be  interested 
in  minimizing  the  traces  of  weighted  error  matrices  and  Fy  ; 

the  exact  choice  of,  and  the  reason  for,  weightings  ^  and  £  will  be  under¬ 
taken  in  the  next  subsection. 

Now  if  we  were  to  minimize  by  choice  of  Ap  .  would  find 

(see  appendix  B  for  method)  that  we  must  solve 

_V,  a;  s£  -  Ar  ,  (1.9) 

and  the  choice  of  would  be  irrelevant.  Also,  if  we  were  to  minimize 
b*  choice  flf  B*  we  would  find  that  we  must  solve 


r  p  't'3* 

T"* vr^*  *7  P-'  >  (120) 

and  the  choice  of  would  be  irrelevant.  Furthermore,  we  would  not  satisfy 
constraint  (108)  generally.  But  since  the  behavior  of  error  matrix  Ff  is 
just  as  important  as  that  of  Ey,  we  should  take  both  matrices  into  account 
in  any  error  minimization;  in  fact,  for  known  correlation,  recall  that  the 
determinants  of  residual  matri  ces  \A  and  were  equal. 
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We  therefore  choose  to  minimize  the  sum  of  the  traces  of  the  weighted 


error  matrices 


Mv Er) + »  w(\,^rrFr)t 


(121) 


where  ^  and  are  Hermitian  and  nonnegative  definite, 
by  choice  of  and  ^subject  to  constraint  (108).  If  we  let 


(122) 


then  we  can  express 


A-  Fp  ■ 

v  R?-  s^v;:  ^rv,::  <]  o23> 

h-  rr,  [s“  -  $  ur::  -  s*  u,:  &r  *  &;u;  sj 

in  terms  of  the  single  unknown  matrix  .  Our  problem  therefore  is  to 
minimize  the  trace  of  (123)  by  choice  of  the  single  quantity  G  ,  subject 
to  no  constraints,  we  can  then  solve  for  the  best  coefficients  according  to 


QX  ,  *r  ’  Gr  W-' 


(124) 


Also  we  can  compute  the  correlation  estimate  from  (90)  and  (32)  according 


\ 


(125) 


In  appendix  I,  it  is  shown  that  the  minimum  of  the  trace  of  '123'.  is 
realized  when  G  is  the  solution  of  the  bilinear  -atri\  equation  '  9ef .  3' 
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where 


?  *  V*  vr:  sp  ur; 

r^r,: 

*>  -  a;,  ur:  s* 


(127) 


Uniqueness  of  the  solution  of  (126)  is  considered  in  subsection  3.6.  (It 
is  interesting  to  note  that  the  separate  minimizations  in  (119)  and  (120) 


yield 


Gcf<<-y  =  V-  6  =  O  • 


(128) 


Thus  whereas  both  these  quantities  had  to  be  equal  separately  to  the  zero 
matrix,  we  now  require  only  that  they  be  equal  to  each  other.) 


For  the  special  case  of  M=1  (a  univariate  process),  (105)  and  (108) 


yield 


Vv 


(m-  i ). 


(129) 


Then  (126)  and  (127)  can  be  solved  for  the  scalar 


(h*  »)■ 


(130) 
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Now,  if  and  only  if 

Pp^-Af.,  (m  *  *) ,  (i3i) 

(130)  reduces  to  Burg's  algorithm  (Ref,  4);  In  fact,  it  can  be  shown  that 

(131)  is  the  only  choice  of  weights  in  (130)  which  guarantees  a  stable 
correlation  recursion  for  M=l.  Thus  we  shall  insist  that  the  weights 
satisfy  (131)  when  we  deal  with  their  selection  below. 

3.5  WEIGHTING  OF  ERROR  MATRICES 

It  is  necessary  to  apply  weighting  to  error  matrices  Ep  and  Fp  in  (112) 
and  (115),  prior  to  minimization  of  the  trace  in  (121),  for  several  reasons. 
First,  without  weighting,  the  larger  amplitude  components  of  errors  (111) 
would  receive  most  of  the  emphasis  in  the  minimization;  thus,  some  weighting 
inversely  proportional  to  the  component  strengths  is  desired.  Second,  it 
is  desired  that  stable  correlation  recursions  result  and  that  matrices  Up 
and  Vp  be  nonnegative  definite.  Without  weighting,  it  has  been  discovered 
(by  an  example  to  be  presented  in  subsection  3.9)  that  both  of  these  require¬ 
ments  can  be  violated.  Third,  we  will  insist  that  the  scaling  property 
introduced  in  subsection  2.2  hold  for  the  unknown  correlation  case  as  well; 
that  is,  if 

\'T>\  ,  I  ,  ,  0  32) 

we  shall  insist  that  the  coefficients  satisfy 


a!’  d"1 
bit  - i>  ’ 


(133) 
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The  matrix  equation  (126)  can  be  combined  with  (122)  to  yield  the 
simultaneous  set  of  equations 

W-  &  -  n34) 

We  now  consider  several  possible  choices  of  weightings  .A^-i  and  fl_,  that  tend 
to  simplify  the  form  of  (134).  The  first  choice  is  no  weighting: 

=  X  .  Choice  I  (135) 

The  problem  with  this  choice  is  that  the  weighting  is  not  related  to  the 
error  component  strengths,  and  it  may  be  readily  verified  that  the  solutions 
to  (134)  and  (135)  do  not  satisfy  the  scaling  property  (133).  Also  an  unstable 
correlation  recursion  can  occur.  However,  the  solutions  do  reduce  to  Burg's 
algorithm  for  M=l;  see  (131). 


Our  next  candidate  weighting  is 

)  fp-i  ,  Choice  2 


which  are  Hermitian  and  are  nonnegative  definite  if  Up_-|  and  are 
nonnegative  definite.  This  weighting  is  inversely  proportional  to  the 
component  strengths,  as  desired;  more  will  be  said  on  this  below.  The 
equations  (134)  become 


f  y  -  y,  if  -  o . 


(137) 


The  solutions  of  (137)  satisfy  the  scaling  property  (133),  and  they  reduce 
to  Burg's  algorithm  for  M=l;  (129)  shows  that  (131)  is  satisfied  for  the 
choice  (136).  Although  stability  of  the  correlation  recursions  (52) 
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and  (55),  and  nonnegative  definiteness  of  Up  and  V  ,  have  not  been  proven 

'  r 

for  general  M*2,  no  counter  examples  have  been  discovered. 


We  next  consider 


jl ,  a. 


o ice  3 


(138) 


in  which  case  (134)  becomes 


(139) 


However,  the  weighting  (138)  is  not  necessarily  Hermitian,  is  not  necessarily 
nonnegative  definite,  and  is  not  directly  related  to  the  error  component 
strengths.  Also  the  solutions  of  (139)  do  not  satisfy  the  scaling  property. 
Furthermore,  the  solutions  do  not  reduce  to  Burg's  algorithm  for  M=l,  and 
can  yield  unstable  correlation  recursions  for  M=l. 


The  last  choice  is 


’  l’r-vr-£%. 


CUol 


Ce 


(140) 


which  are  Hermitian  and  nonneqative  definite,  and  for  which  (134)  becomes 

A?  V. +  4-  •  5£5*V  ^  , 

o. 

This  choice  is  a  very  interesting  one  in  that  the  solutions  of  (141)  are 
immediate  and  do  not  require  that  a  bilinear  matrix  equation  be  solved.  The 
weighting  (140)  is  inversely  proportional  to  the  error  component  strengths, 
and  the  solutions  of  (141)  do  satisfy  the  scaling  property.  In  fact,  this 
choice  is  very  close  to  Choice  2,  since  Up  ^  and  both  estimates  of 

the  correlation  matrix  of  process  Ml  at  zero  time  delay,  and  should  be 
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fairly  close  to  each  other,  However,  the  solutions  of  (141)  do  not  reduce 
to  Burg’s  algorithm  for  M=1 ,  and  the  correlation  recursion  (52)  can  be 
unstable,  even  for  M=1 .  In  fact,  the  solutions  to  (141)  are  identical 
to  those  for  Choice  3  for  M=l. 


Therefore,  of  the  four  choices  considered,  only  Choice  2  in  (136) 
yields  solutions  that  satisfy  the  scaling  property  (133)  and  reduces  to 
Burg's  algorithm  for  M=1 .  The  stability  of  the  correlation  recursions  has 
not  been  proved  or  disproved  for  choice  (136)  of  weighting. 


There  is  another  strong  reason  for  choosing  weighting  (136),  which  has 
to  do  with  a  whitening  interpretation.  We  recall  that  Up-,  and  y  f 
defined  in  (95)  and  (89),  are  estimates  of  the  correlation  matrices  of 
processes  ['T*’0}  and  *  respectively,  at  zero  time  delay.  Now  let 

(for  non-negative  definite  Up_,  and  Vf-,  ) 


(142) 


where  Up., 
processes 


and  Vp_,  are  (lower  triangular)  square  root  matrices.  Then  scaled 


2-K  1  %-»  s  ,  f  *  Ki  M , 


(143) 


each  have  estimated  correlation  matrices  at  zero  time  delay  equal  to  I;  that 
is,  all  the  components  of  f^’0]  (or  )  have  unit  power  and  are 

uncorrelated  with  each  other  at  zero  time  delay. 


Now  define,  for  p+ii)c*sN, 


f  ’  *£  -  u;:  (\r-  hX')  -  X-  K  2T , 


(144) 
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where 


(145) 


Also  define  the  estimated  correlation  matrix  at  zero  time  delay  of  process 


-L-± 


(146) 


where  we  have  used  (144)  and  (112).  Therefore 

([£$),  (147) 

“I 

where  we  have  used  (1-1)  and  (142).  Thus,  minimizing  the  trace  of  Up.,  Ep, 
by  choice  of  A^*,is  equivalent  to  minimizing  the  trace  of  ^  by  choice  of 
(see  (144)),  where  process  ly^jis  the  error  in  prediction  of  (p-l)th 
order  processes  with  estimated  correlation  matrices  at  zero  time  delay  equal 
to  I. 


In  a  similar  fashion,  for  pHiks  N, 


where 


(148) 


(149) 


And 


\  t-M.  v;/>  V/; 


(150) 


wi  th 


W?r  -  "tr  ( VpT,'  Fr). 


151) 
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If  we  solve  (145)  and  (149)  for  and  »  and  then  utilize  constraint 
(108)  along  with  (142),  we  find  that  the  constraint  takes  the  form 

ft]  -  .  on) 

This  could  be  used  as  the  starting  point  in  a  minimization  of  error  matrices 
and  .  In  fact,  if  we  minimize  the  unweighted  trace  off^V^  by  choice 
of  ,  we  find  the  optimum  choice  to  be  given  by 

=  2  ,  053) 

where  the  notation  is  an  obvious  modification  of  (114).  By  employing  (145), 
(143),  and  (142),  we  can  show  that  (153)  is  equivalent  to  (137),  as  it  must 
be.  (This  alternative  approach  may  be  useful  for  proving  the  stability  of 
the  correlation  recursion.) 

3.6  SOLUTION  OF  BILINEAR  MATRIX  EQUATION 

If  we  substitute  definitions  (127)  into  bilinear  matrix  equation  (126), 
and  premultiply  by  and  postmultiply  by  ,  we  obtain  the  equation 

Gj,  %  +  a  Qr  =  p  f  (is-*) 

where 

*  ■  FV  r* 

()55) 

p'Ks?  v,:.  r* 

p.-£ipj£r,5  . 
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the  Hermitian  matrices  2  and  p  are  non-negative  definite;  e.g., 

1A  V  *  (v;;  rrV/^(vf:  rr  »  *  0 


(156) 


for  any  MX1  matrix  °J  ,  since  is  non-negative  definite.  We  have  employed 
the  Hermitian  property  of  V^_t  and  above;  see  (118)  et  seq.  This  means 
that  the  eigenvalues  of  «<  and  p?  must  be  non-negative.  Therefore  the 
solution  of  (154)  exists  and  is  unique  (Ref.  10,  eq.  3). 


Solution  of  the  bilinear  matrix  equation  (126)  or  (154)  has  been  addressed 
by  many  authors  (Refs  9  -  17)-.  In  particular,  for  the  equation  involving  MxM 


matrices , 


XB+ AX  =  C, 


(157) 


one  form  for  the  solution  is  given  by 


X  =  FQ~  , 

cr, 

are  MxM  matrices.  The  constants  fa*}  are  given  by  (Ref.  18,  pp.  87-88) 


-  .  -L 


and  the  matrices  ]f\  }  are  given  by 


Ah  » A  Am  +  a,  r  ub 


(4-r) 


(158) 


(159) 


(a,  =  l)  f  (160) 


(161) 


Here,  M-2  full  matrix  multiplications  are  necessary  when  we  note  that  AM  =  0 
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by  the  Cayley-Hamilton  theorem. 


For  M  *  2,  (159)  takes  the  form 


T  *  C®  -  (A--Wt)l)  c 
<3  =(U+4-8)Bi-(*M-*+$r 


fcr  M*  2  f 


(162) 


where  we  have  used  the  Cayley-Hamilton  theorem  to  express 


BJ  -  i>(P)  l  -  det(B)l  h  =  2 


(163) 


Equations  (162)  and  (158)  are  the  forms  used  in  the  FORTRAN  program  for  M  =  2. 
3.7  SPECTRAL  ESTIMATION 

Having  obtained  correlation  estimates  by  means  of  (82)  and  (80A), 
we  now  extrapolate  these,  as  in  subsection  2.3  (equations  (52)  and  (66)),  to 


yield 


K  =  X  A?  R. 


"  ZT]  Km-«  y  />+l  £  W, 

m«o.  (164> 

This  defines  an  infinite  sequence  which  is  assumed  stable;  its  Fourier 

transform  will  be  taken  as  the  spectral  estimate  of  the  process  under  consid¬ 
eration.  In  a  manner  identical  to  that  given  in  appendix  F,  it  is  found  that 

=  HrW  ,  IfKi'  (165> 

where  0^  and  H^f)  are  given  by  (95)  and  (68),  respectively.  It  follows  that 

^  Gr  *  Sa*fk  (to).  (166) 


(166) 


Also,  as  in  subsection  2.4,  an  alternative  factorization  is  available  as 


^(t)*  a Hr(f)  vf  ^(f)'  ,  Ifl'i ' 


(167) 
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where  and  H^(f)  are  given  in  (89)  and  (70).  If  or  is  non-negative 
definite,  then  Cr^ff)  is  non-negative  definite,  as  desired  for  a  spectral 
estimate.  Since  (165)  and  (167)  are  equal,  we  concentrate  henceforth  on 


form  (165) . 

Since 


(168) 


(165)  can  be  expressed 

<^(f)  -  a\  det  |  \\[M,  <169> 

Since  Grfyis  Hermitian,  matrix  £r(?>(f)  need  be  computed  only  on  and  above  its 
main  diagonal,  at  each  frequency  of  interest.  Efficient  computation  of 
H^(f)  by  means  of  an  FFT  is  undertaken  in  appendix  J.  It  is  shown  that  we 
need  to  perform  M*  ty-point  FFTs  of  p+1  non-zero  numbers,  in  order  to  evaluate 
H^tf)  at  Np  frequency  cells  in  the  frequency  range  ^  >  ^)  ■ 


Real  Multivariate  Process 

The  results  above  have  been  derived  for  a  complex  multivariate  process 
[Xm).  Eor  a  real  multivariate  process,  ^  is  real  and  are  real  •  "I’hen 

for  a  real  process,  ( 1 7 £ 


*  *  Wq.  W’" 


for 


a  real  process. 


(171) 
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Thus  we  need  compute  matrix  only  for  f*b,  for  a  real  multivariate 


process . 


In  order  to  avoid  complex  matrix  multiplications,  we  develop  (169)  more 


explicitly;  let 


(172) 


where  ^(f)  and  T^(f)  are  real  MxM  matrices  at  each  f .  Then  since  \J^  is  real 
=  VJy,  .  and  upon  substituting  (172)  in  (169),  we  find 


where 


<?$  -•‘I'WtfGf  <  Mf)-i  wf/] 

0  rial  yv\ctn ; 


(173) 


M|f)*Ilf)W)T 


(174) 


Since  M(f)  is  real,  the  quantity  (M(f)-i  M(f)T  is  zero  on  the  main  diagonal; 
therefore  we  need  not  compute  the  main  diagonal  of  M(f).  All  the  matrix 
multiplications  in  (173)  are  real. 


Real  Bivariate  Process 

We  now  further  specialize  to  M  =  2,  a  bivariate  process.  Let  the  real 
and  imaginary  parts  of  the  filter  transfer  function  H ^  he  denoted  by  XX 
and  YY,  respectively  (where  these  symbols  are  unrelated  to  X  and  Y  introduced 
earlier);  that  is 

Xx  to + 1  rr(-f ) .  <i75) 

Then  from  (172),  for  2  X  2  matrices, 

\d)  H  fefofW] *  H  xxf) .  x^ff),  <n.) 
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and 

A$  YYtf)*  YXfO-  (177) 

Substitution  of  (176)  and  (177)  in  (173)  yields  spectral  estimate 

-  «\«  #)f[x)i(fX|  ty(f  f+n  (f)  Ur^ff'T  * 1  Mff)  -  •:  AiJf'p 

-fcr-  fli  y^oj  pivQtlcch  prvcess;  ^ 78 ' 

where 

M(f)  *  YX(f)  X^(f)r 

The  2X2  matrices  involved  in  (178)  are  all  real,  and  XX^(f)  and  Yv , ' * )  are 

A 

the  adjoints  of  the  real  and  imaginary  parts  of  H*r>ff)  ,  respective '.y .  Tne 
form  (178)  is  used  in  the  program  for  the  spectral  estimate  of  a  real 
bivariate  process. 

3.8  TERMINATION  PROCEDURE 

For  unknown  correlation,  the  correct  value  of  p  to  use  in  (79)  arc  (30) 
is  unknown.  We  adopt  the  Akaike  information  criterion  ( A I C )  derived  in 
Ref.  19,  page  719: 

AIC lii  2 f 

•  N  In  M  Vf  f  2  ,32' 

where  we  have  utilized  (107);  namely,  we  compute  AICn  for  p  =  0,  l...,p 

w  max 

and  we  use  that  value  of  p,  pL  ,  for  which  AIC  is  a  minimum.  Selection 

best  p 

of  p  is  discussed  below, 
max 

For  purposes  of  updating  Up  and  V  ,  we  can  combine  (105),  (106),  and 
(122)  to  yield 


42 


TR  5501 


Up  *  Ur,-Ap  G>  ,  Vp  =  V^-^r  G>  ,  del) 

in  terms  of  the  solution,  Gp,  of  bilinear  matrix  equation  (126). 

At  this  point,  it  is  worthwhile  to  review  the  procedure  adopted  here. 

From  the  actual  data,  we  could  have  estimated  the  input  correlation  matrix 
via  (99)  (or  some  scaled  version  of  it).  Also  we  could  have  used  (112)  and 
(115)  as  error  matrix  estimates;  in  fact,  these  matrices  are  guaranteed 
Hermitian  and  non-negative  definite.  However,  since  <iet  we  would 

have  had  to  settle  on  some  average  like 

I"  (  det  E?  •  W  fp  *  ( In  dW  Ey  +  U  <fct  (182) 

for  purposes  of  the  information  criterion.  As  for  the  spectral  estimate,  we 
could  have  adopted,  instead  of  (165),  the  quantity  H^ffT*  , 

orAWp^)  t  f0r  example. 

Instead,  we  have  chosen  consistently  to  stick  with  the  results  of  the 

normal  equations  (78).  Thus  the  estimate  of  the  input  correlation  matrix  is 

obtained  from  (80)(and  (82));  the  estimates  of  the  correlation  matrices  of  the 

residuals  are  given  by  (89)  and  (95)  (or  more  computationally  convenient  via 

(181));  and  the  spectral  estimate  is  given  in  terms  of  or  by  (165)  or 

(167),  respectively,  for  p*p.  ..  The  major  gap  in  this  procedure  is  that 

best 

we  have  not  proved  that  Up  or  Is  non-negative  definite  for  Choice  2  of 
weighting  in  (136);  however,  no  counter  examples  have  been  discovered. 

Our  selection  of  Pmax  is  accomplished  as  follows:  in  ref.  1,  page  575, 

% 

Akaike  is  quoted  as  suggesting  j^«3pn  for  M  ■  1,  a  univariate  process.  Since 
the  number  of  coefficients  evaluated  is  p,  and  the  number  of  available  data 
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(183) 


points  Is  N,  this  ratio  was  upper  bounded  by  3/V  .  We  extend  this  idea 
directly  to  the  multivariate  case:  the  number  of  scalar  coefficients 
evaluated  is  .  and  the  number  of  available  scalar  data  points  is  MN. 
Upper  bounding  this  ratio  by  3N  »  we  find  we  should  choose  the  filter 
order 

in  terms  of  the  number  of  data  points,  N,  and  the  dimensionality  of  the 
time  series,  M. 


3.9  EXAMPLES 

It  is  worthwhile  to  summarize  here  the  sequence  of  calculations  required. 

For  data  X^ ,  X2,..,  X^  available  (with  the  sample  mean  removed),  we  have 

M 


>?•£■  x., 

5 

vJ0  - v.  -  "X.  -  Nr^x»x"lU-"* 


\L 


184) 


Then  forp>i  and  choice  (136)  of  weighting. 


<  •  v,:;  5^ 

f  *  ^  Mr' 

Grf  v*  (uw) 


W’~B  fQf 
AXC^.N  UcUv^t  2M> 


185) 
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,  Vf"’)-  A(p t*) 

*K  Ik  ?  +  |ik$W 

5^.-1—  ± 'if  vf  .  5f" 

r  N-p-l  «»f«  15  *  P 

5*.  _!_  jr  £’  .  -i_  t!  2^2”" , 

‘•'p  N-p-l  Ai-r-l  **  r 


d¥«  _l _  5"  y**2#H 

JP  *  M_f- I  k.^,  'K  C*-<  * 


086) 


(187) 


For  p  =  pmax’  11  is  not  necessary  t0  compute  (186)  through  (187).  When  the 

best  value  of  p,  p  ,is  found  from  AIC  ,  we  can  then  compute  the  spectral 
best  p 

estimate  (165). 


We  now  consider  an  example  for  M  =  2,  N  =  4: 


V-Vo-K 

Then  for  weighting  (136),  we  find 


*-M- *•[:)•  *■[:;]  ,*■[:], 

v.-Vo-K*p'0  hi] 

f  i  nrl  L  -J 


(188) 


r  ,  r»»  '*)  «*>  X.\  '  3 

2,J  «L<  ' 


(189) 


The  eigenvalues  of  are  (-3t/^*)/i2  ,  which  are  both  bounded  by  1  in  magnitude, 
as  they  must  be  for  the  correlation  recursion  (164)  for  p  =  1  to  be  stable. 


Also, 


w, 


_  A1 

72  6  16  > 

- 


(190) 
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which  is  non-negative  definite.  Thus,  for  weighting  (136),  all  the 
desirable  properties  are  realized. 


However,  for  no  weighting,  (135),  we  find,  for  the  same  example  (188), 


20  30 
30 


-10  0 
-2.1  . 


(191) 


The  eigenvalues  of  are  4/9  and  -10/9;  since  the  latter  is  larger  than  1 


in  magnitude,  the  recursion  8m  =  A?  Rm.(  »  w2l,  is  unstable.  Also 


«■- v.  ■-& 


"1 


39  5*7 

5"7  5->J  , 

which  is  not  a  non-negative  definite  matrix.  It  is  found  that  the  spectral 
estimate  obtained  from  (165)  has  frequency  rangeswhere  the  two  autospectra 
(diagonal  terms  of  (165))  are  negative,  and  where  the  magnitude-squared 
coherence  can  be  negative  or  greater  than  1.  These  are  all  unacceptable. 


(192) 


For  the  alternative  example  for  M  =  2,  N  =  4,  of 

X, 


r  -.1*1 

- 

-1.15 

'VI 

ii 

?l 

,  V 

?>j  >  v 

-* 

L  - 

„  - 

(193) 


and  no  weighting,  we  find  a  stable  correlation  recursion,  but  U-j  and  V,  are 
not  non-negative  definite,  and  values  of  the  magni tude-squared  coherence 
greater  than  1  are  realized  in  some  frequency  ranges.  3ecause  of  these 
unacceptable  behaviors,  the  choice  of  no  weighting,  (135),  is  discarded  from 
future  consideration. 


An  example  for  M  =  2,  N  =  100,  and  weighting  (136),  generated  via  (71)  - 

(73)  of  subsection  2.5  yielded  the  results  below;  the  program  and  its  output 

are  given  in  appendix  K.  We  find  p  =1  and 

best 
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At- 


.*7/5 1 

-11024 

.566/3 

.77098- 

.63432 

•  5603SJ 

''  1 

-63*2 

.  8(573 

(194) 


U,  *  .09110 


v?m  -  ook"7 

■corn  i.  02  36-f 


V  =.o9uo 


.3  util 

[  sun  i.ioonj . 


(195) 


It  is  worthwhile  to  compare  these  estimates  for  N  =  100  with  the  exact  values 
in  (76)  and  (77).  The  scale  factor  .09110  in  (195)  is  unimportant  and  is  due 
to  the  fact  that  the  white  noise  used  here  had  variance  1/12  rather  than  1 
as  in  (73);  except  for  the  scale  factor,  the  matrices  in  (195)  have 
determinants  equal  to  1.  The  estimated  magnitude-squared  coherence  reaches 
a  maximum  of  .999745,  versus  the  true  peak  of  .999013. 

Observations  from  other  examples  of  real  bivariate  processes  have  pointed 
out  that:  the  eigenvalues  of  A,  ana  8*  are  identical  and  are  bounded 
by  1  in  magnitude;  the  eigenvalues  of  and  ^  are  not  identical  for 
p>2  »  and  can  be  larger  than  1  in  magnitude;  and  the  eigenvalues  of  A„ 
and  bIT  for  n<p  can  be  larger  than  1  in  magnitude. 

Timing  Results 

Some  sample  execution  times  on  a  UNIVAC  1108  for  SUBROUTINE  PCC, 
which  evaluates  the  partial  correlation  coefficients,  are  presented  below 
for  M  =  2,  a  bivariate  real  process. 
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Table  1.  Timing  of  Subroutine  PCC 


N 

pmax 

Time  of  Execution  (sec) 

| 

100 

10 

0,25 

100 

15 

0.35 

1000 

10 

2.63 

1000 

40 

9.23 

10000 

50 

120  , 

10000 

150 

326  ! 

The  execution  time  is  almost  linearly  proportional  to  N  and  P!T.ax.  The 
execution  time  for  PEFTF  was  1.25  seconds,  and  that  for  SCM  was  0.55 
seconds,  both  for  Np  =  1024  frequency  cells;  see  appendix  \  *cr  orpgram. 

4.  SUMMARY 

A  method  for  multivariate  linear  predictive  spectral  analysis, 
employing  weighted  forward  and  backward  averaging,  has  been  presemec  and 
programmed  in  FORTRAN.  The  method  consti tutes  a  generalization  of  Burg's 
univariate  algorithm  (Ref.  4)  to  the  multivariate  case. 

The  choice  of  weighting  in  the  error  minimization  is  very  important, 
and  several  candidates  have  been  considered.  The  weighting  retained,  (136), 
is  the  only  one  of  those  considered  that  satisfies  both  the  scaling  property 
(133)  for  all  M,  and  reduces  to  Burg's  algorithm  for  M  =  1.  Also,  the 
weighting  retained  is  equivalent  to  minimizing  the  unweighted  traces  of 
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error  processes  that  are  the  differences  of  approximately  white  processes; 
in  fact,  (136)  could  be  used  as  the  starting  point  of  the  error  minimization. 

The  major  gaps  in  the  analysis  are  that  we  have  not  proved  that  Up  and 
Vp  are  non-negative  definite,  and  we  have  not  proved  that  correlation 
recursion  (164)  is  stable;  however,  no  counterexamples  have  been  encountered. 
The  major  analytical  block  in  this  endeavor  is  the  bilinear  matrix  equation, 
(126),  which  requires  special  treatment  for  its  solution. 


49/50 
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Appendix  A 


PROPERTIES  OF  A  SPECTRAL  DENSITY  MATRIX 


Suppose  an  arbitrary  linear  filter  with  impulse  response  {Hn>  is 
excited  by  input  {X^} .  The  output  at  time  kA  is 

V-2XX*,.  <A 

where  the  sum  is  over  all  non-zero  summands.  and  Yk  are  M  x  1  matrices, 
whereas  Hn  is  M  x  M.  In  steady  state,  the  spectra  of  the  processes  in  (A-l) 


(A-l) 


are  related  by 


&TW  • 


(A-2) 


wnere  transfer  function 


HW  *  -21  exj> (-,* ZttUa)  } 


and  f  frequency  in  Hz  and  is  real. 
Now 


(A-3) 


GrJP)M  “  a  2Ltxp(i  »  4  2Lexp(~i2trf»A)Rn  •  6r„(f);  (A-4) 

where  we  have  employed  (2).  Thus  ( f )  is  Hermitian  at  any  value  of  f. 
Similarly  Gy(f)  is  Hermitian  at  any  f. 


K?  •  X  y; 


(A-5) 


is  non-negative  definite  for  any  H(f),  because 


fX'fy .  n/H  x  y„h  i  =  \y\  r  *  o  <*-<) 


A-l 
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for  any  M  x  1  column  matrix^  .  Therefore 

-  P 

,  [jf  Gytf)  =  ]  <#  H(f)  (A-7) 

is  non-negative  definite  for  any  H(f).  It  then  follows  that 

(-f)  is  mow- *e<jsrive  def/V'fce  -for  a!!  ■£,  'A-8) 

To  prove  this,  assume  that  Gx(f-,)  is  not  non-negative  definite;  than  '■  *'  we 
choose  H(f ) "  I  $(f-fi),  that  is,  an  impulsive  transfer  funct'cr  nea 

frequency  f -j ,  we  get  G  )  from  .»r, ich  contradicts  the  ct"  s i or. 

ry> 

that  Rq  must  be  non-negative  defin:te. 

Thus  a  spectral  density  matrix  must  always  be  Hermit! an  an:  np^-f.egat' -a 
definite  for  all  f.  Ir.  particular,  this  implies  that  all  the  auto  react-' 
(diagonal  terms  of  the  matrix'  must  oe  real  and  nor-rer.uive.  T :  a .  s : 
that  all  coherences  are  boundea  by  um'tv  ir.  magnitude. 
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Appendix  B 

MINIMIZATION  OF  TRACE  OF  ERROR  MATRIX 


From  (4)  and  (5),  we  have 

K  "  I 


where 


Let 


0  *  [A,  , 

XX' 


XX  -  £• 


(B-l) 


(B-2) 


(B-3) 


Here,^  is  M  x  Mp,  \  is  Mpxl,  £.  is  MxMp,  and  0  is  MpxMp.  We  notice 
that  QH  s  Q,  and  >0  for  any  Mpxl  matrix  t'Vo,  if  no  exact 

linear  relation  exists  between  the  elements  of  X„_,  »  that  is» 

9  is  Hermitian  and  positive  definite. 


YYm  = 

‘VaeH-eaH*asQH 

^-ea"cY(a-co'’)Q(a-co'Y 


where  V  is  an  Mpxl  matrix.  Then  for  the  M  x  M  matrix  in  (B-5), 

J 


(B-4) 

(B-5) 

(B-6) 
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Jj)w=i  > 

where  complex  scalar 

4  =  ^hqu.  (3 

The  real  quanti  ty  for  any^fO  ,  since  Q  is  Hermitian  ana  oositive 

aefinite;  the  minimum  value  of  X-  is  zero  and  is  attained  i f  ana  only  if 
^/.  «.  o  .  Therefore,  tr  L  is  minimized,  attaining  value  zero,  by  the  z.noice 
c\j.  t  o  I  4 j  s  M  Thus  tr  Y„  Y*M  YK  ' $  minimi  zed  by  the  cnoice  of  d  as 

J 

.  [{-.  fl  ■  i<£\ 

since  the  leading  two  terms  in  (8-5)  are  independent  of  CL. 


Then  we  have  opt  L  =  0  and 

°pt  xxSv CQ" '(?h  -  a-  o¥ch  ■  K0 -  4 Q (£ 

A1  SO 


win 


-  *  oft'- xx1 .  tr(vec"c 


")  =  xX-v(: 


(3-10) 


'3-11' 


It  should  be  noted  that  the  solution  (9-9)  is  attainable  directly  from 
(3-4)  if  the  coefficient  of  d  (or#)  is  set  equal  to  zero;  this  observation 
will  be  useful  later. 


Equations  (B-9)  and  (8-10)  can  be  developed  as  follows: 


V=C 


yields,  with  the  use  of  (B-2)  and  (8-3), 


K*  <] 


R,  V  *r? 

^  »-[V  Kr], 


K. 


R.J 


( B  - 1  2 ) 


8-2 
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that  is. 


n* i 


And  (B-10)  can  be  expressed  as 

:  *  *  -  £ *'X 

k\ 

Equations  (B-13)  and  (B-14)  are  the  main  results  of  this  appendix, 


(B-13) 


(B-14) 


If  an  exact  linear  relation  exists  between  the  elements  of  X*.,  r-  ,X,-f  > 


^  S0m * 


*  ° 


(B-15) 


In  this  case,  (B-l)  yields 


X-i  *  X<-i  "  ^  A*  X-i-i  *  X<-»  ~  jfj  V'  ^-j  “  \  '  (b-16) 


Therefore  we  can  get  zero  error  by  choosing 


<*•-« ,  i*"*  p-i 


n*»  '  ]  '  (B-17) 

Thus  if  an  exact  linear  relation  exists  between  the  elements  of 
^*-i  Xup  • 

Also  we  have  the  following  general  theorem: 

VM 

No  exact  linear  relation  ^  v  •  *  lis  positive  definite.  (B-18) 

between  elements  of  X*,.-,  ^  ^  lo  -  1 


To  prove  this,  let 


%  \ 


D  *  ; 


(B-19) 
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>4 

Then  F  s  t>  is  a  scalar.  Now  if  and  only  if  an  exact  linear  relation 

exists,  F„  =  0  for  some  D  *  0,  no  matter  which  member  function  of  the 
ensemble  we  select  (with  probability  one).  We  also  notice  that 

Wf  *  (B-20) 

and  that  the  ensemoie  average  in  (B-20)  is  equal  to  the  matrix  in  (3-18). 


Assume  that  rK  *  0  -'or  any  3*0.  Then  :>o  F°r  drV  3  *  0,  and 
the  right-hand  side  of  (3-20)  is  positive  for  any  D  V  Q.  Therefore 
xx:  is  posi tive  def ini te . 


Conversely  if  xx  ^  positive  definite,  the  right-hand  side  of 
(B-20)  is  positive  for  any  3*0.  Then  |£|*  >0  *or  any  3*0,  yielding 
F„  *  0  for  any  D  *  0. 


B-4 
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Appendix  C 

INTERRELATIONSHIPS  OF  Un  AND  V 

P  p 

We  start  with  the  definition  (12)  and  develop  Up  as 


(by  (15)) 

a,  +  ATiLBf’R. , 

(by  (12)) 

A\,h  -  aT  Dr 

(by  (13)) 

-  aI’bJ’u,.. 

(by  (14)) 

(T-  A'BDU,., 

This  relation  holds  for  p>1,  with  Rfc.  A  similar  derivation  for  V. 


yields 


v,-( ,/>*!;  v.-  V 


(C-2) 


The  determinant  of  Up  is  given  by 

MU,.  *Kl-W  WU^ 


whereas  the  determinant  of  V„  is 

P 


•  MVr 


(C-3) 


(C-4) 


c-i 


TR  5501 


Now  if  det  Up_1  *  det  Vp_1>  then  (C-3)  and  (C-4)  indicate  that 

det  =  det  . 

But  since  U0  =  V0  =  R0  ,  det  U*  =  det  V# .  Therefore  (C-5)  hc'cs  *'?' 
piO,  by  inauction. 


C-2 


in 
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Appendix  D 

HERMITIAN  PROPERTY  OF  EXTRAPOLATED  CORRELATIONS 


Me  know  that 


ft*  *  ftw  fcr  I M  5  f 

We  then  solve 

£aSX..  K. ,  i*k*p 

i 


for  m,  ,  and  set 

ftjf  *  |k|s  f 

*  w  •  I 


We  then  define 


"R  ^  .  ft1*  W  p+i  =  k 


In  a  similar  fashion  for  the  backward  case,  we  solve 


-U,  ,  i»k*r 

NS  I 

for  Wl,f  • and  set 


L  iHkp. 


We  then  define 


4  k. 


(D-l ) 


(D-2) 


.  (D-3) 


(D-4) 


(0-5) 


(D-6) 


(0-7) 
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We  know  from  the  definitions  above  that 


=R‘r>  f»r  |kl*J»- 


Now  we  assume  that 


>  “R(r)  -f<*-  \k\<  wW«  ^ 

— K  ** 


that  is,  from  (D-6)  and  (0-3), 


i 


•-M 


ii 

£-> 

W*  ) 

f i ni t i on  ( 0-6 ; , 

eL’1" 

W-  " 

to-*). 

S'” 

«»  t  J=  1  4 

|  dj  D-j'1 

,df 

j»i  J.T7  •■'—■j)  " 

!i-  >1)) 

■w 

i,  a^k1'’ 

(I>~ 

j.,  j 

‘  -/ 

4y  (M‘ 

t»r  •  -  k  i 


"-O' 


"nerefore  we  have  extended  vD-9)  by  one  step,  arc  t 
^*0+1  by  induction. 


r  „  -  r 
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Appendix  E 


RELATIONSHIP  OF  DETERMINANTS 


The  forward  correlation  recursion  is  given  in  (52)  as 


vlr)  -  '4"  if) 


The  z-transform  of  this  sequence  is 


<f> 


^  =  ±?ti* 

*»i 

The  inner  sum  on  m  can  be  expressed  as  (see  (53)) 

£?-<*X+ 


Therefore , 


«(*) 


w»  t  n»  l 


or 


**  i  '  t 


(E-l) 


(E-2) 


(E-3) 


(E-4) 


(E-5) 


At  the  same  time,  we  define  the  z-transform  of  the  backward  correlation 
recursion  as 


&*)* 

K.pt-t 


(E-6) 


and  note  that,  via  (62), 


fw- 

■iH 


(E-7) 
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A  comment  on  notation  is  timely  here.  If  matrix 

G-fe)  *  ri)„  , 

where  i  is  a  complex  scalar  variable,  then 


(E-8) 


(E-9) 


But 


&wH  -  2(4" t>:  , 

which  is  not  always  equal  to  'E-9),  unless  2  is  real. 


( E  - 1 0 ) 


But  let  us  also  develop  definition  (E-7)  by  means  of  backward  recursion 
' 55),  in  a  manner  similar  to  that  above  in  (E-l)  through  (E-5).  Ae  find 

±ri?\  b!,h  (h  to) 


+  -  i 


V<*>*  p+ 


;e-u  ) 


”ne  inner  sum  on  m 

1  s 

0-*-»  ,  v 

f  l 

i 

<=o 

-  ?  " 

nu 

where  we  used  ,  56  <  , 

'  n  \ 

'  <-  j  > 

(E-3),  and 

or 


v  '  v  '  I  V*-  i 


IE-13 


v  -  * 
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Combining  (E-5)  and  (E-14)  according  to  (E-7),  we  see  that 

ckt(x-^*"AT)  (e-'5) 

must  have  the  same  zeros,  since  these  two  quantities  determine  the  singul¬ 
arity  locations  of  (E-5)  and  (E-14).  The  quantity  <Rn(z)  defined  in  (E-3) 
is  singular  only  at  z  =  0. 


Furthermore 

and 


(E-16) 


(E-17) 


where  we  have  utilized  the  observations  that  the  quantities  in  (E-15)  have 
the  same  zeros,  the  same  pole  at  z=0,  and  the  same  scale  factor.  Therefore 
the  two  determinants  in  (E-15)  are  equal. 


Also  since 

d?l(X-^2'"6rJ)-  I  -  4vGr,  —  +(-  (T ,  (E-18) 

it  follows  that 

tr/f-VB r.(tr^)* 

and 

«WA (E-a» 


E-3 
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Numerical  examples  show  that  generally 

\^r$\  *  4-  i<k£f 


(E-21) 


and 


( E-22 ) 
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Appendix  F 


SPECTRUM  FROM  EXTRAPOLATED  CORRELATIONS 


The  forward  correlation  recursion  is  given  by 

tr)  -  r1  > 


(F-l) 


where 


t'I’  -  ,  h  =  ? 

and 

.  t!fH  ,  pu  p+i. 

-in  w  /  / 

We  wish  to  evaluate  the  z-transform  of  UVS)  •• 

^w-^4 


(F-2) 


(F-3) 


(F-4) 


In  order  to  do  so,  consider  a  fictitious  process  [Xn]  with  the  corre¬ 
lation  given  by  (F-l)  through  (F-3).  Consider  the  output  of  the  optimum 


predictive  error  filter,  given  by 


Y.  .1  *. 


(F-5) 


The  crosscorrelation 


c. .  x  x.: .  -  ±A* <f-6> 


Using  (7)  and  (F-l),  we  see  that 


(2^=0  "fir  \  t  115  o]  ) 


(F-7) 


F-l 
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that  is,  predictive  error  filter  output  Y  is  uncorrelated  with  all  pas' 

A 

values  of  input  • 

Also,  output  autocorrelation 


T>_  *  X  X 


It-"  W*e 


if-s; 


using  (F-5)  and  (F-6).  But  now  employment  of  (F-7)  in  (F-8)  shows  that 


3X-°  *i!' 

Also  (F-7),  (F-6),  (F-2) ,  and  (12)  yield 


=  Ue 


o  w  n*o 

And  since,  from  definition  (F-8), 


"  -o 


v  F  -  9 1 


we  have 


*T>„  , 

-m  **’  ? 

*».•  [u-r°  ]  ; 


(f-i; 


A  A 

that  is,  predictive  error  filter  output  Yk  is  white  for  input  Xk. 
course,  Up  is  not  diagonal). 


(Of 


At  the  same  time,  autocorrelation  Dm  can  be  expressed  (by  means  of 


(F-5))  as 


All  M  . 


(F-13) 


Therefore  the  z-transform  of  D„"\  is 


F-2 
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±ri>.  *  ti'fS a 

k>.«  *1*0  "M**  J 


fe") , 

where  we  have  used  (F-13),  (23),  (F-4),  and  (E-9).  When  we  couple  (F-14) 
with  (F-12) ,  we  obtain 


(F-14) 


(F-15) 


or 


J%)  -  ^^fV)]"; 


(F-16) 


where  matrix  Up  is  independent  of  2.  This  is  one  of  the  main  results  of 
this  appendix. 


If  we  let  (for  f  real ) 

?  *  expO^-Trfa)  ,  If)  <  2l 


(F-17) 


and  denote  the  forward  predictive  error  filter  transfer  function  and  spectral 

respectively,  then  the  spectrum  of  process  {X*"j  can  be  expressed  as 


<?’fr)  -  -  W  UP  $(f)"  , 


(F-19) 


where  we  have  utilized  the  result  that  (see  ( E-8) )  through  ( E- 1 0) ) 


[$?  =  ^(f)H  =  • 


(F-20) 


F-3 
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The  procedure  for  the  backward  correlation  recursion  parallels  that 


above  to  yield  (using  (23)) 


V,6CV)] 


^ fc\  _  .  LlPf r\H  \  /  \  rr\',H 


GTft  - 


V,  W 


ZERO  LOCATIONS  OF  (9) 


Assume  that  ^(^(9)  *  K  Q(z)  has  a  zero  at  z=z,  *  0;  that  is, 


(F-21) 


(F-22) 


Ossumc  QC?,)  =  0,  W*  *evo  mair ix, 


(F-23) 


where  0  <  |*il  .  But 


=  -  £?£  *  - »A^,+.-+  iT Af-  ?p  1] .  ( f-i 


Therefore  is  finite  for  £><1^,1,  yielding 


Qt*»)  =  0 


*  x. 


Therefore  assumption  (F-23)  is  invalid,  indicating  that 


Q(?)  ^  C  for  0  <  1*1  . 


Now  from  ( F - 2 4 ) 


(F-25) 


(F-26) 


F  -4 
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therefore. 


QU)-  -?f  4^  °- 


(F-28) 


Thus  Q(z)  has  a  p-th  order  zero  at  z=0,  but  is  not  equal  to  the  zero  matrix 
for  0<l^).  Of  course,  the  individual  elements  of  matrix  Q(z)  can  have  zeros 
anywhere . 


POLE  LOCATIONS  OF 

Since  from  (F-24) 


tf'w.-e-'QW, 


(F-29) 


where  Qp(z)  is  a  matrix  of  polynomials  in  z  of  order  p,  it  follows  that 

Q w  -  - *FQr&'  -  -  W'  (F'30) 

A 

where  ^  W  is  a  matrix  of  polynomials  in  z  of  order  (M-l)p.  Therefore 
the  poles  of  Q(z)  are  caused  by  the  zeros  of  det  Qp(z);  that  is,  the  poles 
of  W  are  caused  by  the  zeros  of  det  (*) .  As  lei***,  <«-rW  F-24); 
therefore,  Q(z)  ~  I  as  |e|-*<&,  50  that  Q(z)  has  no  poles  at  lal*®- 
Thus  the  poles  of  Q(z)  are  located  where  det  -  0- 

We  now  consider  the  problem  of  determining  when  det  ^%)*o '»  the 


following  derivation  is  based  upon  Ref.  7.  Let 


(F-31 ) 


x..., 


be  an  Mpxl  matrix.  Define  prediction 


i  -  cl 


(F-3  !) 
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where  C  is  MpxMp.  And  define  error 


where 


W«-X  =  X-  . 

•  (t.-cur-x:c") 

cv.cn 

*  %-%v:X  +  {c -v,v;')%{<:-mcy 

Qi  .  <)  H 

^  =  Jh  • 


( F-33) 


{ F-24 ) 


(F-25) 


u 

rhe  minimum  value  of  tr  S  is  realized  when  (see  appendix  B)  we  select 


C  -  %% 


(F-36) 


”he  corresponding  value  of 


U*  -  %-vxX -%-cus\ 


(F-37) 


_  i-j 

si'ce^slf,  .  Now  let  the  left  eigenvectors  and  eigenvalues  of  the  optimum 


C  be  denoted  as 


?„HC  mp- 

(The  eigenvectors  may  not  all  be  linearly  independent).  Then 

o  i  i*:tf  =  ?:  u:  t  -  ?:  x- cuc% 

=  ?X?„0-iua) 


(F-33) 


(F-39) 
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Now  is  Hermitian,  block  Toeplitz,  non-negative  definite,  and  has 
the  form 


v.  -  IX 


Xo  ‘  "Rf » 

K.\ 


( F-40) 


Therefore  |\wJs|  for  |3|*sMp;  that  is,  all  the  eigenvalues  of  C  are 
bounded  by  unity  in  magnitude.  Furthermore,  Ref.  7,  p.  134,  shows  that 
if  there  is  no  exact  linear  relation  between  the  elements  of  X„ }  X*., X*.r, 
then  I  <(  for  |sw«  Mp  (see  also  appendix  B). 

Now  we  develop  the  error  in  (F-33)  in  more  detail: 


s,-4-c* 


M 


X  1 

c.  -  c; 

V 

* 

• 

t - 

Minimizing  tr  can  be  seen  to  make  C  of  the  form 

A?  A?  "• 

TO  0 

C  *  I  0  r  0 

*  \  » 

.  * 

0  '  I  0 

Therefore  (Ref.  7,  eqs.  (35)  and  (36)), 


(c-  x  r)  *  (-  >)Mf 


(F-41) 


(F-42) 


( F-43) 


F-7 
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If  we  were  to  assume  that  MTftt.  o,  where  l*tj  i  I  ,  we  would  have  det(C-?,l) 
But  this  contradicts 1  for  |<wiMp.  Therefore,  the  zeros  of  det 
all  lie  inside  the  unit  circle;  that  is,  the  poles  of  Q(z)  all  lie  inside 
the  unit  circle. 


'-e 
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Thus,  the  one-step  extrapolated  correlation  matrix  estimates,  based  on 
order  p-1 ,  are  Hermitians  of  each  other. 


G-l/G-2 
Reverse  Blank 


TR  5501 


Appendix  H 


INTERRELATIONSHIPS  OF  U  AND  V  FOR  UNKNOWN  CORRELATION  CASE 

P  p 


We  develop  the  definition  (95)  as  follows: 


■  V.?  (At4-  A’J  B$1L  -  A*  ±bT|L,  (l  J  N)  <-*>»)) 

K-O  ^  ' 


(H-l) 


Therefore 


If*  I 


K  -  ,|BrX 


)r.Vr,.A>r[|A-X-^ 

,ur.-AXur. 


In  a  similar  manner,  we  can  show  that 


Vr  -Cx- BfA7)Vr 


(b  0’)) 


(tjM) 


( H-2 ) 


( H-3 ) 


( H  -  4 ) 


In  order  to  show  that  Up  is  Hermitian,  we  recall  the  constraint  (98) 


and  express 


TR  F  50 1 


Therefore  if  Up-i  ~  Up-i  aj^^p-r  ,  it  immediately  follows  that 


(H-5) 


V)PH  -  uf 


(H-6) 


Similarly  since 


it  also  immediately  follows  that 

V0M  =v 

Vp  Vp  • 

3ut  properties  (H-6)  and  (H-7)  are  obviously  true  for  p  =  0,  because 

u.-\-k-K 

Therefore  (H-6)  and  (H-7)  are  true  for  all  p,  by  induction. 

In  order  to  relate  the  determinants  of  U  and  V  ,  we  express  (--2 

P  p 

and  (H-4)  as 

V,  ■  »W-  B')ur,  ,  v, .  ur-ETXv,., 

Therefore  if  det  Up- 1  =  det  Vjp-  / ,  ^  e~\ 

Op  *  ^  V*  ; 

But  (H-10)is  obviously  true  tor  p=0  by  ( H - 3 ) .  Therefore  0-12)  is  true 
for  all  p,  by  induction. 

Properites  (H-6),  (h-7),  and  ( H - 1 0 )  applied  to  'OB'  immediately  snow 


(H-7) 

(H-8) 


:h-9) 


that 
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Appendix  I 

MINIMIZATION  OF  TRACE  OF  WEIGHTED  ERROR  MATRICES 

We  wish  to  minimize  the  trace  of  W 23 )gfc>y  choice  of  matrix  Gp.  We 
use  the  fact  that,  for  square  matrices  P  and  Q, 

ir(po)  -  *  Mop),  u-i) 

to  express 

+(v^v;:^v;:tu;:^V£9^-jiT,^*v;;  -u;  ^rrr)G>],  d-2> 

Now  (1-2)  is  an  analytic  function  of  the  variables  Re(G  )  and  Im(G  ). 

mn  mn 

Therefore  the  minimum  of  (1-2)  is  realized  simply  by  setting  the  coefficient 
of  Gp  equal  to  zero  (Ref.  20).  W^  obtain,  after  premultiplying  by  A..,  and 
post-multiplying  by  ,  the  equation  for  Gp: 


a^v,:  s-£s,  ■  5pv;;  r;  +  j;;  hi:  - 


(1-3) 


(Gp  is  not  Hermitian  or  Toeplitz,  as  numerical  examples  will  show.)  In 
terms  of  A^and  B^,  we  have  the  simultaneous  equations 


where  we  utilized  (122) 


0-4) 

A^.-U^bJ1"  *  0 


I-1/I-2 
Reverse  Blank 
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Appendix  J 


COMPUTATION  OF  FILTER  TRANSFER  FUNCTION 


The  forward  predictive  error  filter  transfer  function  is  given  in 


(68)  as 


Hf(f)  *-  ;  Fl<i 


Now  d  ivide  the  frequency  range  (-jL ,£■)  into  Np  cells  of  width 

A-  •  7^  ir  ■ 

Then  for  |*| 

4)  =  H? (v)  -  -  £  ?*?(-<  Af 


where 


-Af  ,  0  i  «sy 


H.  M 

Now  if  we  let  the  sum  in  (J-3)  be  denoted  as  an  Np-pomt  1:1 


(J-l) 


(J-2: 


2T e*r(-r7-n-».i/ 0  5  *  \ 


then  (J-3)  becomes 


t,  ,  0! 


\ 


Then  quantity  ^  in  (J-5)  is 


HD-R182  402 


SCIENTIFIC  RND  ENGINEERING  STUDIES;  SPECTRAL  EST I NATION 
<U>  NAVAL  UNDERNATER  SVSTEHS  CENTER  NEWPORT  RI 
A  H  NUTTALL  1977 


- - - T 

6/7 


UNCLASSIFIED 


F/G  7/4 


NL 


1.25 
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Appendix  K 

PROGRAM  FOR  SPECTRAL  ANALYSIS 


In  this  appendix  we  present  the  program  for  the  procedure  summarized 
In  (184)  -  (187)  and  (165).  The  spectral  estimate,  (165),  is  computed 
at  frequencies  f*n/(N* A)}  •* 

£r) "  *  ttjr)  ^  >  w  *  nf/2)  (k 


where  the  forward  predictive  error  transfer  function  ■ftjd  Is  given 
by  (J-6).  The  specific  scaling  adopted  is  based  upon  (166),  which  takes 
the  sampled  form 


(K-2) 


where  is  a  set  of  integration  weights  (e.g.,  trapezoidal).  The 
approximation  is  a  good  one  if  G^(f)  is  sampled  finely  enough;  that  is, 
if  Np  is  large  enough  to  resolve  the  peaks  and  valleys  of  G^P)(f).  If  we 
employ  (J-2),  (K-2)  becomes 

!*(fc)*\i  ( K-3A) 

or,  for  trapezoidal  weighting, 

*  K  ,  (MB) 

where  we  have  employed  the  periodic  nature  of  G^p^(f)  (See  (165)  and  (68)). 


K-l 
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'.hus  the  sum  of 


samples 


equals  the  sample  power,  (80). 


For  a  real  multivariate  process,  we  can  employ  (171);  a  modified  form 
emerges: 

s  £>  rea‘  prtce"-  /v., 

where  fw^\  is  another  set  of  integration  weights.  This  is  the  ^  sir- 
programmed  in  the  following;  the  quantities  computed  are 


'K-5 


The  real  part  of  their  weighted  sum  equals  the  sample  power,  c  .  Tie  re¬ 
used  here  to  evaluate  (J-5)  is  given  in  Ref.  21;  it  is  limited  to  powers  o* 
2,  but  could  be  replaced  if  desired.  Input  parameters  are  N,  FMA aid 
NF  in  line  22,  and  the  input  data  call  is  in  line  37  and  SUBROUTINE  DATA, 
all  these  quantities  have  to  be  changed  by.  the  user  to  fit  his  particular 
application.  The  program  is  written  for  a  real  multivariate  process 
(general  M) ,  with  the  exception  of  FUNCTION  DETERM,  SUBROUTINES  SOM, 
INVERT,  and  SOLVE,  and  the  printout  of  the  spectral  density  matrix,  (K-5). 
Arrays  used  in  the  program  are  explained  by  comment  statements.  A  sample 
printout  follows  the  program. 
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C  MULTIVARIATE  LINEAR  PREDICTIVE  SPECTRAL  ANALYSIS* 

c  employing  weighted  forward  ano  backward  averaging, 

C  THIS  PROGRAM  IS  WRITTEN  FOR  REaL  PROCESSES  AND  GENERAL  M,  WITH  THE 
C  EXCEPTION  OF  FUNCTION  DETERM  AnO  SUBROUTINES  SOM*  INVERT*  AND  SOLVE, 
C  AND  THE  PRINT  OUT  OF  THE  SPECTRAL  DENSITY  MATRIX. 

c  user:  change  LINES  22  ANO  37#  aNO  replace  subroutine  oata. 

CM:  DIMENSIONALITY  OF  MULTIVARIATE  PROCESS’  INTEGER  INPUT 
C  N  :  NUMBER  OF  DATA  POINTS  IN  EACH  PROCESS!  INTEGER  INPUT 
C  XU*1>...X(N*1)*...*XU*M)..,X(N»M)  :  INPUT  DATA!  ALTERED  ON  OUTpuT 
C  PMAX  =  MAXIMUM  ORDER  OF  FILTER |  INTEGER  INPUT 

C  NF  :  SIZE  OF  FFT  (MUST  BE  A  POWER  OF  2  TO  USE  MKLFFT) I  INTEGER  INPUT 

C  AVE  :  MEANS  OF  INPUT  DATA I  OUTpUT 

C  H  :  COVARIANCE  MATRIX  OF  INPUT  DATA!  OUTPUT 

C  AIC  :  AKAIKE'S  INFORMATION  CRITERION!  OUTPUT 

C  PSEST  =  BEST  ORBER  OF  FILTER!  INTEGER  OUTPUT 

C  UoEST  :  MATRIX  OF  COEFFICIENTS  IN  SPECTRAL  ESTIMATE!  OUTPUT 

C  AP  :  MATRIX  OF  FORWARD  PARTIAL  CORRELATION  COEFFICIENTS!  THEN  : 

C  MATRIX  OF  FORWARD  PREDICTIVE  FILTER  COEFFICIENTS  FOR  PBEST»  OUTPUT 
C  BP  :  MATRIX  OF  BACKWARD  PARTIAL  CORRELATION  COEFFICIENTS!  OUTPUT 
C  XX, YY  :  SPECTRAL  MATRICES!  OUTpUT 

PARAMETER  M:2  u  bivariate  process 

PARAMETER  N:  100  *  PMAX:  10*  NF:102w»  nF41=nF/4+1 

INTEGER  PBEST » P 

DIMENSION  X (N*M) * Y (N.MI*Z(N,M> , UBEST (M*M) , AP (h,M,PMAX ) , 
S6P(M,M,PMAX)*AVE(MJ  *XX(NF»M,M).YY(NF»M,M)  .C0SKNF41)  , 

SU (M. M) , V (M*  M ) *  UI (Mf Ml*  VI (M*  M ) ,A(M*M) *B(M*M) ,R(M,M) , 

S  W  A  ( M  *  M ) t WB ( M  *  M ) , WC l M  »  M ) , WO ( M , M ) *WE(M*M) * AIC (PMAX) ,AlC0(2> 
EQUIVALENCE  (X,Y)#  (AIC(l)#AlC0(2U 
C  PRINT  OUT  VALUES  OF  PARAMETERS 
I=N 

JSPMAX 

KSM 

lsnf 

PRINT  1*  I  * J*K»L 

1  FORMAT ( 1H1 *  *  N  :'»I6»10X»»PmAX  :*,U»10X»*m  s' , 12* iOX* *NF  :'*X5> 

c  input  data  in  x(1»i>...x(n»1)#...*x(1#m)...X(n,m) 

CALL  OATA 
PRINT  2 

2  FORMAT (/'  INPUT  DATA!*) 

JsN-99 

LSN-200 
DO  3  I=1»M 
PRINT  4*  I 

IF ( N.LE.200 )  60  TO  5 
PRINT  6,  (X (K* I ) ,KSl* 100) 

PRINT  7,  L 

7  FORMAT- !*»»•  INPUT  DATA  POINTS  NOT  PtINTED  HERE*) 

PRINT  (X (K, 1 ) »K?J»N) 

GO  TO  3 

3  PRINT  b*  ( X (M » I ) *K?1*N) 

3  CONTINUE 

4  FORMAT (»  PROCESS  NUMBER* *l2) 

b  FORMAT (5C20.B) 

C  EVALUATE  PARTIAL  CORRELATION  COEFFICIENTS 
CALL  PCC 
PRINT  • 


K-3 
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tt  FORMAT?/*  MEANS  OF  INPUT  DATA?*) 

PRINT  6*  (AVEd)  »  ISXrM) 

PRINT  9 

9  FORMAT?/?  COVARIANCE  MATRIX  OF  INPUT  DATA.**) 

PRINT  6,  ? ?R?I»J> »I=l#M) »J=1»M1 

PRINT  10 

10  FORMAT ?/¥  AKAIKE  INFORMATION  CRITERION? •> 

PRINT  11#  (P#AIC(P>#PS0»PMAX) 

11  FORMAT ( 110# £20*6) 

PRINT  12#  PBEST 

12  FORMAT?/*  PBEST  S*#I3) 

PRINT  13 

13  FORMAT (/»  UBESTJ*) 

PRINT  b,  ((UBEST(I#J)#Isl»M)»Jsi#Ml 
PRINT  14 

14  FORMAT?/*  FORWARD  PARTIAL  CORRELATION  COEFFICIENTS: * ) 

DO  IS  Psl#PMAX 

15  PRINT  16#  P# ??AP?I»J#P># !sl,M) #JS1»M) 

16  FORMAT? I10»6E20. 8) 

PRINT  17 

17  FORMAT?/?  BACKWARD  PARTIAL  CORRELATjON  COtFFIClENTs? * ) 

00  18  PslfPMAX 

18  PRINT  16#  P# ? ?BP(I«J#P) *lSlfM) #JS1»m) 

IF?PttEST/EQ.O>  60  TO  19 

C  EVALUATE  PREDICTIVE  FILTER  COEFFICIENTS 
CALL  PFC 
PRINT  20 

20  FORMAT?/*  FORWARD  PREOICTIVE  FILTER  COEFFICIENTS  F OR  PfaEST:’) 

LO  21  P=1#PB£ST 

4i  print  i6»  p» ? ?ap? i, j.P) , i=i,m) , j=i*y) 

c  evaluate  predictive-error  filter  transfer  function 

19  CALL  PEFTF 

c  evaluate  spectral  oensity  matrix  and  Coherence 

RsNF/2+l 
CALL  SUM 
PRINT  22 

22  FORMAT?/*  SPECTRAL  OENSITY  MATRIX  AnO  COHERENCE  For  v=2:'> 

PRINT  23 

23  FOHMAT?aX.»8!N*tl9X,!AUT011»#l4X,'AuT022,.10X, *«Eal < C*oSS12 ) \-*X» * 
$IMA6?CR0SS12) ’ »9X» *MA6  SO  COH* * UX# •ARGUMENT* ) 

PRINT  16*  (L#XX?L#1#1) ,XX?L,2*2) #XX?L»1*2) ,YY(L#1»2) #Yy?L»1*1) ,YY? 
f L * 2 # 2 ) #  LS1 #K) 

SUBROUTINE  ORTA 

C  THIS  SUBROUTINE  GENERATES  DATA  FOR  Ms2t  BIVARIATE  PROCESS 

OEF INE  1RAN08 1 *5** 15f ( | ItSIRN ( 1 » I •$••151 ) / 2 ) *34359738367 
DEFINE  RAN03FLOAT?l)/393ft97363t7, 

U5281 

TA=0, 

T»»C, 

00  1  KsitlOO  0  BILL  DISCABO  TtfESC  INITIAL  POINTS 
lslHANQ 

T».85*TAT.75fTB^RANO*.5 

ZsIRANO 
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TBs , 65*TA+ • 55*TBfRANP- . 5 

1  TA=T 
X(1#1)=TA 
Xa»2)=TB 
00  2  Ks2(N 
IslRANO 

Ts • B5*TA- . 75* TB^RANO* • 5 
IslRANO 

TBs , 65*TA+ . 58*TBtRANP- , 5 
TA=T 

X(K,l)sTA 

2  X(*'2)sTB 
return 

SUBROUTINE  PCC 

C  THIS  SUBROUTINE  COMPUTES  PBEST,  UBEST»  AND  THE  PARTIAL 
C  CORRELATION  COEFFICIENTS  FOR  P  r  I  TO  pMAX!  ANY  R 
UN 

JSPMAX 

IAs3«*SQRT (N)/M 

IF(PMAX.6TVIA)  PRINT  If  JfXfIA 

1  FORMAT (  / *  PMAX  s»»I4#*  IS  TOO  LARGE  FOR  NUMBER  OF  DATA  POINTS  |* 
S'U**’  SEARCH  LIMITED  TO  P  =*,I4) 

I ASM I N ( I A « PMAX )  S  UPPER  BOUNO  ON  PMAX I  EG  183 

FaCs2.*M*M/N  0  FACSO.  fcOULD  FORCE  PB£ST  EOUAL  TO  PMAX 

c  subtract  means »  fill  in  oata  arraysi  Eg  ho 

DO  2  ISi#M 
TA=0. 

DO  3  Ksl»N 

3  7  ASTA+Y (K» I ) 

7ASTA/N 
AVE(I)STA 
DU  2  KsitN 
Y(R.I)SY(R»I)-TA 

2  Z(K*I)SY(K»I) 

C  INITIALIZE  CORRELATION  MATRICES!  EOS  82*  114,  AND  105 
CALL  AUTO(2'N-l,Y'«C) 

CO  4  Isl»M 
DU  4  JSI»M 
TA=T(l,I)*Y(l»J) 

TB=Y(N,I)*Y(N»J) 

R(l» J)S(MC(I»J)+TAtTB)/N 
»A ( I , J) stfC ( I , J) +TB 
4B(I»J)sWC(Z»J)<fTA 
R(J,I)SR(X»J) 

4A ( J* I )SMA ( 1 , J) 

4  »blJ'I)3MB(I»J) 

CALL  EQUAL(RfU) 

CALL  EQUAL(R»V) 

CALL  CROSS (2»N»Y»Y«WC) 

C  BEGIN  RECURSION 

AXC(0ULOG(DETERM(U)) 

AICMXNSAXC(O) 

PBESTsQ 
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CAUL  EQUAL (U»UdEST) 

CO  5  PSiflA 

c  evaluate  matrices  required  in  bilinear  matrix  equation*  eq  126 

CALL  InVERT(V»VI) 

CAUL  MULT(VI»*4'*D) 

CALL  EQUAL (WO>  *ti) 

ChLL  IuVEKT(U»UI) 

CALL  EQUAL (*A»*0) 

CALL  MUlT(WO>Ul,WA) 

CALL  AOG(»C»WC'*C) 

C  SOuVE  BILINEAR  MATRIX  EQUATION*  EG5  157-lbl 
CmLL  solve 

u  evaluate  partial  correlation  coefficients*  eq  124 

CAUL  MULT(WC#VI»A) 

CALL  TRANSiWCtftO) 

'.ALL  MULTIWD»UI#B) 

CALL  EuUAL { A  #  AP ( 1 » 1  *  p } ) 

CALL  EQuAUb»bP(l#l.P)  ) 

C  UPDATE  MATRICES  U  AND  V*  Eq  18  j. 

CAU.  MULT(A#wOm»E) 

CALL  SuB(UrftE'U) 

Call  molT(B*»C»«E) 

CALL  SUH  (  V  m*E*  V  ) 

C  CALCULATE  AKA IkE * S  INFORMATION  CRITERION*  EQ  180 
AlC(H)SLOOlDETERMlu)  )-*-FAC*P 
Ir (AIC(P) .GE.AICMIN)  GO  TO  6 
AlC*lN=AICtP> 

PdtSTsP 

CALL  EQUAL (U#Ud£ST) 
o  If IP.EQ.IA)  GO  TO  5 

C  UPOATt  J A  T  A  SEQUENCES  Y  ANQ  Zl  EQ  111 
L=P+1 

LO  7  K=N»L»-1 
CU  8  ISI#M 
TA=Z(K-I.I) 

CO  9  USl.M 

9  T m=TA-d ( 1 »U) *Y (K #U) 

o  2 i K » 1 ) ST  A 

CO  10  1=1 i M 
TA5Y(K#I> 

lU  11  U=1 *  4 

11  Tm=TA-A( I»U)*Z(K-1# J) 

10  Y ( K i I ) STA 

7  CONTINUE 

C  CALCULATE  NEW  CORRELATION  MATRICES*  EQ  Uh 
call  AuTO(p42»N* Yf«A) 

CALL  AUTO(P4l#N—l»Z»wB) 

CALL  CROSS {P^2  *  N »  Y  ♦  Z »  WC  ) 

=  CONTINUE 

lF(M.EQ.l)  RETURN 
a=M-1 

CO  12  1  =  1** 

L=i*l 

DO  12  U=L*M 
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UtJES T(I»J)=,5*( UBEST ( I » J ) ♦UqEST (J»I)  ) 

12  UBtST(J»I>=UBEST(I,J> 

RETURN 

C 

SUBROUTINE  PFC 

C  THIS  SUBROUTINE  COMPUTES  THE  PREDICTIVE 
C  FILTER  COEFFICIENTS*  ANY  Ml  EQ  79 
IF(PBEST,LE.1>  RETURN 
DO  1  Ps2»PBEST 
I A=P-1 
DO  2  LZI’IA 
Ib=P-L 

CALL  MULT(AP(1#1»P>  # BP (1,1,  jB)  »*A) 

CALL  SUB(AP(1,1,L)  ,InA,wA) 

CmLL  MULT(BP(1»1»P>  #AP(1«1*l)  »*B) 

CALL  SUB (BP (1,1, 10) ,BB,BP(1,1,IB)) 

2  CALL  EOiiAL(WA,AP(l,l,L)  ) 

1  CONTINUE 
RETURN 

L 

SUBROUTINE  PEFTF 

c  this  subroutine  computes  the  predictive-error 
C  FlLTtR  TRANSFER  FUNCTION*  ANY  m*  EQS  66  AND 
Ksl . 4427*L0G ( NF ) 5 
CALL  OTRCOS(COSI,NF) 

CO  1  1=1, M 
CO  1  J=1»M 

ax (i,x , j) =0 • 

IF  ( I •£<■>•  J)  XX(l,I,d)=l. 

Y  r (1, 1, J)=0. 

IF(PBESTaEQ.O)  go  TO  2 

U=PBESm 

DO  3  L=2,IA 

XX  (L,  I ,  J)=*AP(  I ,  J,L«*1 ) 

3  YYlL,I,J)=0. 

2  lA=PBEST*2 
CO  4  l=ia,nf 
XX (L* I , J)=0 • 

4  YY(L,I,J)*0. 

1  CmLL  MKLFFT(XX(l,I,Ji,YY<l,I»J) ,C0SJ,K,-1) 

RETURN 
C 

SUBROUTINE  SOM 

C  THIS  SUBROUTINE  COMPUTES  THE  SpECTRAL  DENSITY 
C  MATRIX  A NO  COHERENCE  FOR  Ms2*  IQS  176  A NO  K-5 
TsE./NF 
DO  1  L=1,X 
4A (1, 1)=XX(L,2,2) 
i»A  ( 1 , 2)S“XX  (L,  1 ,2) 

WA (2, 1 ) r*XX (L,2, 1 ) 

MA(2,2)sXX(L,l,l) 

»B(l,l)sYY (L,2,2) 

40(1,2)s-YY(L»1,2) 

40(2, l)s-YY (L,2, 1) 

*B(2,2)=YY(L,1,1) 
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TA=OET£RM<*A)-OETERM<*B) 

TB=WAU»i)*WB(2»2)tWA<2#2>*wBU#l)-WA(l»2)*W0(2»i>' 

TA=T/(TA**2+TB**2> 

CALL  TRANS  (WA»«»C) 

CALL  MUlT<UBEST#WC>WO) 

CALL  MULTl*B»WU»*C) 
lb=WC(l»2)-WCl2# 1) 

CALL  MulT(WA#*0»#CI 
CALL  TRANS(*B*ttO) 

CaLL  MUlT(UBEST»wO»w£) 

Call  MULTUb»te£»wO) 

CALL  AOO  < »»C  t  *D »  *C  ) 

YY(L#  1»1)S{WC<1#2)**2+TB**2)/(*C(1»1>**C(2>2)  )  » 

Y Y (L»  2*  2) =ATAN2 { Td »  #C ( I >  2 ) }  « 

XX(L»l»l)=TA*mC(lrlJ  W 

XXlLf2»2>=TA*WCl2»2> 

XX  tL’ 1 » 2 )  STA*teC  (1  »2)  <0 

YY (L* 1#2)STA*TB  S 

XX(L»2»1)S0. 

YY(L#2.i)=0. 

1  CONTINUE 
RETURN 

C 

SUBROUTINE  CROSS ( N1 » N2 »  a i B » c )  Q  A  *  B »  A  N6 

c  this  subroutine  computes  a  cross  correlation  matrix » 

LIMENSION  A(N»M) 

LOUBLt  PRECISION  T 
1)0  1  1  =  1  frt 
C.0  1  J=1#M 
T=0.D0 

LO  2  K=Nl#N2 

2  T=T*A(X, I>*B(K-1.J) 

1  C(I»J)=T 
RETURN 

C 

SUBROUTINE  AUT0(Ni»N2»A»BJ  8  A»  A  NG 

u  this  subroutine  computes  an  auto  correlation  matrix) 

LIMENSION  A(N»M) »BIM»M) 
lGUBLE  precision  t 

LO  1  I— 1 #M 
LO  1  J=I *M 
T=O.OC 

LO  2  KSN1»N2 

2  7=T*A(K,  I)*A(K#U> 
t(I.J)=T 

I  d  i  U»  I )  =B  ( I  *  J) 

RtTURN 

C 

SUBROUTINE  EQUAL(A#8) 

L  This  subroutine  sets  t*o  mxm  matrices  equal 

LIMENSION  A(M»M) *falM»M) 

LO  I  I=1#M 
LO  1  J=1»M 
i  h(LJ):AII»J) 

RETURN 
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fcA(2»l)*W0(i»2) 


MAG  SQ  COH 

argument 

AUTOll 

AUT022 

REAL(CROSSl2) 

iMAG(CROSSla) 


any  mi  eq  Hub 
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SUBROUTINE  TRANS (At B1  0  At  A  NG 
C  THIS  SUBROUTINE  TRANSPOSES  AN  MXM  MATRIX 
DIMENSION  A(MtM) tB(MtM) 

CO  1  I=ltM 
DO  1  J— 1 »M 
1  B(ZtJ)sA(JtI) 
hETURN 
C 

SUBROUTINE  AOU(AtBtCl  0  AtBt A  OK 

C  THIS  SUBROUTINE  AODS  TWO  MXM  MATRICES 
DIMENSION  A(MtM) tB(MtM) ,C(M,M) 

DO  1  1=1 1 M 
DO  1  J=ltM 

1  C(It J)=A(It J)+B(ItU) 

RETURN 

C 

SUBROUTINE  SUB(AtBtC)  Q  AtBt A  OK 

C  THIS  SUBROUTINE  SUBTRACTS  TWO  mXM  MATRICES 
DIMENSION  A(MtM) tB(MtM) ,C(M,M) 

00  1  I=ltM 
DO  1  J— 1 tM 

1  C ( I t J)=A(It J)-B(ItU) 

RETURN 

C 

SUBROUTINE  MULT ( AtBtC)  0  A tB t A  NG 

C  THIS  SUBROUTINE  MULTIPLIES  TWO  MXM  MATRICES 
DIMENSION  A(MtM}tB(MtM)'C(M,M) 

DO  1  1=1 1 M 
DO  1  J=ltM 
T=0. 

DO  2  K=ltM 

2  T=T4-A(ItK)*B(KtJ) 

1  C(ItJ)=T 

RETURN 

C 

SUBROUTINE  INVERT (AtB)  (J  AtA  NG 
C  THIS  SUBROUTINE  INVERTS  A  2X2  MATRIX 
DIMENSION  A(2t2) tB(2t2) 

TA=1./DETERM(A) 

B(ltl)=A(2t2)*TA 
D(2t2)=AUtl)*TA 
b ( 1 1 2 ) =-A ( 1 1 2 ) *  T A 
B(2tl)s-A(2tl)*TA 
RETURN 
C 

SUBROUTINE  SOLVE 

C  THIS  SUBROUTINE  SOLVES  BILINEAR  MATRIX  EQUATION 
C  FOR  M=2t  BIVARIATE  PROCESSl  EQS  157 t  I58»  AND  162 
TA=WA(l,l)+ftA(2t2)+WBflt  lJ*wB(2t2> 

TB=DETERM ( W A ) -DETERM ( WB ) 

CALL  MULT(WCtWBtWD) 

*E  ( 1 1 1 ) =WA (2t 2} 

*E ( 1 1 2 ) :-W A ( 1 1 2 ) 

*E ( 2 1 1 ) :-W A ( 2 1 1 ) 

WE(2t2)sWA(ltl) 
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call  mulT(v»e»v*c,wa) 

CALL  AQD(toD'WA'WD) 
Adll#l)=TA»tfBU»l)tT& 
#b(2»2)=TA*WB<2»2)4’Tfc 
Wd  ( 1 » 2)  =TA*1wB  ( 1 » 2) 

*d(2#l)=TA**B<2»lJ 
CALL  INVERT ( WB»  w£) 

CALL  MULT(WD#WE» WC) 

RETURN 

C 

FUNCTION  OETERM(A) 

c  this  function  computes  the  determinant  o~ 

DIMENSION  A(2»2) 

DclERM=A(i,i)*A(2>2)-A<if2;*A(2/: 

RETURN 

Eiw 

SUdrtOUTINE  MKLFrTU*  Y»CC»M, jSN' 

DIMENSION  X(l),Y(i)»CC(l)»L(12: 

EQUIVALENCE  (L12*L»ii),  (L11,L(2)  -  .  vL I0#L! J; 
i  l  L?  *  L  ( 6 )  )  »  l L6 » L  ( 7 )  i  »  ( L5  >  L 1 6 )  >  »  v L*+ , '  9 ;  ;  #  :  L  j 
2(Ll»L(12i  ;• 

N04=N/4 

(«D4Pl=N04+i 

U04P2=N04Pl+i 

r.U2P2=ND4+N04P2 

CO  8  LO— 1 t M 

LMA=2**(M-L0) 

LIX=2*LMX 

iscl=n/lix 

DO  8  LM=1»LMX 
IAKG=(LM-1)*ISCl+1 
IF(IAR0.LE,NB4P1)  GO  TO  4 
C=-CC(ND2P2-IARG) 

S=I5N*CC(IARG-ND4} 

GO  TO  6 
4  C=CC(IARG) 

S=ISN*CC{ND4P2-IARG} 
o  DO  ij  LI=LIX#N#LIX 
J1=LI-LIX+LN 
J2— Jl+LMX 
T1=X(J1)-X(J2) 

T2=Y(J1)-Y(J2) 

X(J1)=X(U1)*X(U2) 

Y  i  Jl )  =  Y  { Jl)  -f  Y  ( J2 ) 

X(U2)=C*T1-S#T2 
YtU2)=C*T2+S*Tl 
8  CONTINUE 
CO  40  Jrl *  12 
L(JI=1 

IF(J-M)  31*31*40 
SI  L(U)=2**(M4>1-U) 
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40  CONTINUE 
JU=1 

00  60  dl=l»H 
00  60  U2=J1»L2»L1 
Dv  60  J3=J2»L3,L2 
CO  60  J4=J3*L4»L3 
00  60  J5=J4,|_5»L4 
00  60  U6— J5rL6»L5 
00  60  J7=J6#L7,L6 
00  60  J8=J7»L8»L7 
00  60  u9=J8>L9#L8 
00  60  J10=J9»L10»l_y 
00  60  Jll=J10>Lll»L10 
UO  60  JR=Jli»L12»LU 
IMJN-uR)  51*51*52 

51  h=X(JN) 

A ( JN)=X( JR) 

X (JR) SR 
FI=T(JN) 

Y(JN)=Y(JR) 

Y (JR)=FI 

52  JN=JN+1 
60  CONTINUE 

RETURN 

ENO 


SubROUTINE  GTKCOS(C'N) 
01 MENS I ON  C(l) 
N4l=N/4+l 
SCL=6.283185307/N 
00  1  I=1*N41 

1  C(1)=C0S(  (I-1)*SCU 

RETURN 
ENU 
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aumkl  inf  ohm  a  t  ion  criterion: 

0  .261567:>2+01 

1  -,4711bl07+01 

2  -.46664973+01 

3  -.46229663+01 
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FORTRAN  Program  for  Linear 
Predictive  Spectral  Analysis  of  a 
Complex  Univariate  Process 


Albert  H.  Nuttail 
ABSTRACT 


A  FORTRAN  program  for  evaluating  (1)  the  linear 
predictive  complex  filter  coefficients  and  (2) 
the  power  density  spectrum  of  a  complex  univariate 
process  is  presented,  and  its  use  Is  demonstrated. 

A  pitfall  of  using  this  approach  for  estimating  the 
cross-spectrum  of  two  real  processes  is  pointed 
out,  and  a  limitation  of  the  complex  predictive 
filter  for  waveform  estimation  is  considered. 
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FORTRAN  PROGRAM  FOR  LINEAR  PREDICTIVE  SPECTRAL 
ANALYSIS  OF  A  COWLEX  UNIVARIATE  PROCESS 


1.  INTRODUCTION 

Spectral  analysis  of  a  coaplex  univariate  process  via  linear  pre¬ 
dictive  and  aaxiaua  entropy  techniques  is  considered  in  reference  1, 
and  Fortran  prograas  for  real  data  are  presented  there  in  appendix  J. 

In  this  report,  we  present  a  prograa  for  handling  the  case  of  coaplex 
data,  yielding  as  an  output  the  auto-spectrua  of  the  process.*  Coaplex 
data  can  be  encountered,  for  exaaple,  when  a  narrowband  real  process  is 
coaplex- demodulated  to  a  low  frequency  and  sampled  at  a  rate  cooparable 
to  the  bandwidth  of  the  process.  When  the  new  center  frequency  is 
zero,  the  process  is  called  the  coaplex  envelope. 

In  section  2,  an  exaaple  of  the  use  of  the  prograa  is  presented, 
and  the  changes  that  the  user  aust  aake  for  his  application  are  pointed 
out.  In  section  3,  the  possibility  of  using  this  prograa  to  estiaate 
the  cross- spectrua  of  two  real  processes  is  investigated  and  found  to 
be  undesirable.  In  section  4,  a  t imitation  of  the  coaplex  predictive 
filter  for  coaplex  wavefora  estimation  is  considered,  and  a  possible 
generalization  is  indicated  to  alleviate  the  problem. 


•The  theory  and  notation  for  this  case  were  developed  fully  in 
reference  1  and  will  not  be  repeated  here,  for  sake  of  brevity;  the 
reader  is  referred  to  that  earlieT  material  for  all  details. 
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2.  USE  OF  PROGRAM  FOR  SPECTRAL  ANALYSIS 


The  program  for  spectral  analysis  of  a  complex  process  consists  of 
five  parts:  a  main  program  and  four  subroutines,  as  listed  in  appendix  A. 
Input  parameters  to  the  main  program  (listed  in  statement  15)  are 

N,  number  of  complex  data  points, 

PMAX,  maximum  order  of  filter  considered,  and 

J,  size  of  FFT  for  spectral  estimate. 

The  sample  program  generates  a  data  example  in  lines  20-33  and  must  be 
replaced  by  the  user  to  fit  his  particular  applications. 

A  sample  output  for  N  =  100,  PMAX  *  10,  is  presented  below.  It 
indicates  that  PBEST  =  1,  which  agrees  with  the  actual  value  of  p  (see 
statement  21  of  the  main  program).  The  fractional  powers  sum  up  to 
0.99999829  instead  of  1;  the  difference  is  a  measure  of  whether  the 
spectral  estimate  has  been  adequately  sampled  in  frequency.  (If  the 
error  is  too  large,  J  may  be  increased.) 

Since  the  autospectrum  of  a  complex  process  is  real,  but  not 
necessarily  even,  it  is  necessary  to  compute  the  spectrum  over  both 
negative  and  positive  frequencies.  Thus  bin  1  corresponds  to  zero  fre¬ 
quency;  bin  J/2  ♦  1  corresponds  to  ^  Nyquist  frequency,  *_1/(2A);  and 
bin  J  corresponds  to  frequency  -1/(J£),  where  A  is  the  sampling  inter¬ 
val  in  time. 

An  example  of  a  spectral  estimate  of  1000  samples  of  a  complex 
envelope  of  surface-bottom  forward  scatter  at  a  20°  grazing  angle  at 
frequency  750  Hz  over  a  20  nautical-mile  path  is  presented  in  figure  1, 
where  the  sampling  rate  is  1  Hz.  There  is  observed  to  be  a  pair  of 
spectral  peaks  at  *_l/4  Nyquist  frequency,  a  strong  very  low  frequency 
component,  and  a  rather  symmetric  spectrum  about  zero  frequency.  In  fig¬ 
ure  2,  the  direct  path  is  employed  instead,  the  sampling  rate  is  0.1  Hz, 
but  1000  samples  are  still  used.  The  center  portion  of  the  spectral 
estimate  reveals  a  double  peak  near  zero  frequency  and  a  rapid  drop-off 
away  from  this  frequency.  With  these  few  data  points,  resolution  capa¬ 
bility  of  this  quality  is  very  hard  to  achieve  by  any  other  spectral 
analysis  techniques. 


IMMI  iMlfcX 
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Figure  2.  Spectrum  of  Direct  Path 
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3.  ESTIMATION  OF  CROSS-SPECTRUM  OF  TWO  REAL  PROCESSES 

Suppose  that  processes  u(t)  and  v(t)  are  real,  zero-mean,  and  sta¬ 
tionary.  If  we  form  the  complex  process 

x(t)  =  au(t)  ♦  Bv(t),  (1) 

where  a  and  8  are  complex,  then  the  autocorrelation  of  x(t)  is 

RxxCt)  h  x(t)x* (t  -  t)  =  R*xC-t) 

=  |a|2R  (t)  ♦  J  6 1 2R  (t)  ♦  a8*R  (t)  +  a*8R  (-t),  (2) 

I  I  uu  V  y  II  VV  UV  UV 

where  the  crosscorrelation  of  u(t)  and  v(t)  is 

Ruv(x)  =  u(t)v(t  -  T)  .  (3) 

The  auto-spectrum  of  x(t)  is  the  nonnegative  real  (noneven)  function 

oo 

GxxCf)  =  t  dT  exP  (-i2irft)R  (t) 

-00 

=  |a|2G  (f)  ♦  I S I 2G  (f)  ♦  a8*G  (f)  ♦  o*8G*  (f),  (4) 

J  I  uu  ^  II  v V  UV  UV 

where  the  cross -spectrum  of  u(t)  and  v(t)  is 

Guv(f)  =  "  dT  exp  (-i2irfT)Ruv(T)  =  G*y(-f) .  (5) 

•  ao 

Now  let  us  decompose  the  cross-spectrum  as 

Guy(f)  *  R(f)  ♦  il(f).  (6) 

for  which  (5)  yields 

R(-f)  =  R (f) ,  I(-f)  =  -  1(f).  (7) 
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Utilizing  (6)  and  (7)  in  (4),  we  obtain 

G  (f)  =  !a|2G  (f)  +  | S 1 2G  (f) 

XX  uu  1  1  vv 

+  2Re(a8*)R(f)  -  2Im(a6*)  I  (f) , 

G  (-f)  =  !  ct  [  2G  (f)  +  (S(2G  (f) 

XX  1  uu  11  vv 

♦  2Re(a6*)R(f)  ♦  2Im(otS*)  I  (f) .  (8) 


Solving  (S)  for  R(f)  and  Iff),  the  real  and  imaginary  parts,  respec¬ 
tively,  of  cross-spectrum  G  .(f),  we  obtain 


R(f’  -im hr- 


[(Wf) 


G  J-f)  - 


2  I  *  1  " 


XX 


G  (f) 
uu 


V  J 
-  I  - 


'G  CO] , 

VV 


1(f) 


1  1 
4  Im(a£*) 


rn 


Gxx1-f53' 


(9) 


if  Re(a£*)  *  0  and  Im(u£*)  #  0.  Since  a  and 
a£*  =  (1  -  i)/2,  for  which  £•-  =  'a£*'- 
choose  =  '  £ !  *■  =  l/*2;  then  (9)  becomes 


are  arbitrary,  we  choose 
1/2.  For  simplicity,  we 


Rffl  = 


G  (f) 
xx 


G  ( -  f  ) 

xx 


G  (f) 
uu 


Gvv(0 


=  EVE.\{  G 


xx 


(  t ) 


f) 


uu 


G  (ft 

VV 


G  ( f)  -  G  (-f) 

1(f)  »  — - ^ — — -  =  ODD{Gxx(f)}.  (10) 


Now  we  need  only  compute  R(f)  and  1(f)  for  f  _>  0,  as  (“)  indicates. 

An  alternative  method  to  (10)  was  presented  in  reference  2,  equa¬ 
tion  (4).  However,  that  method  required  calculating  four  auto-spectra, 
whereas  the  current  method  requires  calculating  only  three  auto-spectr3 
Guu(f)  and  Gvv(f)  are  real  and  even,  whereas  Gxx(f)  is  real,  but  not 
necessarily  even. 
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x(t)  =  a[v(t)  ♦  u(t)]  ♦  ib[v(t)  -  u(t)],  (14) 

which  is  the  preferred  form. 

For  known  autocorrelations  or  auto-spectra,  (10)  furnishes  a  valid 
way  of  calculating  the  real  and  imaginary  parts  of  the  cross-spectrum 
of  real  processes  u(t)  and  v(t).  However,  when  the  auto-spectra  are 
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unknown,  spectral  estimates  must  be  substituted  in  (10).  Although  R(f) 
and  1(f)  can  both  take  on  positive  or  negative  values  in  any  frequency 
range,  there  is  a  constraint  on  their  magnitudes.  Namely,  the  magni¬ 
tude-coherence  is  upper-bounded  by  unity.  However,  several  numerical 
examples  using  the  programs  in  appendix  A,  for  cases  where  the  true 
magnitude-coherence  was  near  unity,  yielded  estimated  magnitude-coher¬ 
ences  greater  than  unity  in  some  frequency  ranges.  This  was  traced  to 
the  fact  that  the  estimate  of  Gxx(f)  can  be  too  small  and/or  the  esti¬ 
mates  of  Guu(f)  and  GVv  (f)  can  be  too  large.  This  type  of  coherence 
estimate  is  intolerable;  hence,  estimation  of  cross- spectra  of  real 
processes  by  means  of  the  auto-spectrum  of  a  complex  process  is  dis¬ 
couraged.  The  same  conclusion  is  offered  for  the  method  in  reference  2. 
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4.  A  LIMITATION  OF  COMPLEX  PREDICTIVE  FILTER 


The  theory  behind  the  program  presented  in  appendix  A  has  been 
given  in  reference  1  and  is  based  on  a  linear  predictive  technique. 
Specifically,  given  p  past  values  of  complex  process  (x^),  a  linear 
one-step  prediction  of  xj<  is  attempted  according  to  reference  1,  equa¬ 
tion  58: 


P 

T  a  x. 
n  k-n 

n«l 


(15) 


If  we  express  all  the  quantities  in  (15)  in  terms  of  their  real  and 
imaginary  parts  according  to  definitions 

\  ‘  fik  *  *V  xk  ■  uv  *  ‘V  ■  a„  *  ‘V  (16) 


then  (15)  can  be  expressed  as 

P 

Uk  *  ^■[anuk-n  "  6nVk-n^ ' 
n«l 


vk  '  1,  <enVn  *  Vk-n>- 

n*l 


d~) 


But  (1")  is  not  as  general  as  the  form  for  prediction  given  by 
(un>  vn>  ®n»  an  real): 

P 

u,  *  y  (U  U,  ♦  V  V.  )  , 
k  L,  n  k-n  n  k-n 
n«l 


'k  *  *  Vk..1' 

n»l 


(18) 


It  is  apparent  that  the  mean-square  errors  of  predictions  in  (18)  could 
be  made  smaller  than  those  in  (l”),  in  general. 

The  complex  estimate  of  x^  that  can  be  formed  from  (18)  is 

P 

\  *  **k  *  ^  (gnXk-n  *  hnXk-n)’  (19) 

n«  1 
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where 


g 


n 


i[u  ♦ 

21  n 


a  ♦ 
n 


i(6n 


h  * 
n 


a 

n 


1 (S  *  v  )]. 

n  n 


CO', 


(The  form  T9'  is  the  one  alluded  to  in  reference  1,  footnote  to  equa¬ 
tion  0  58")  .9  The  coefficients  {gn  and  {hn'-^  in  T9''  are  two  com¬ 
pletely  independent  sets  of  complex  constants  that  can  be  chosen,  -Tree 
the  complex  predictive  filter  in  C5't  is  obviously  a  special  case  cr' 
T9),  it  is  expected  to  have  a  more  limited  abilit;.  in  ..aver  r~  predic¬ 
tion  than  T9';  however,  .  151  ma\  suffice  for  spectral  r- i  -  \ . -at  i  :r. 
purposes.  'For  a  real  process,  ■  1 Q'!  reduces  to  TS  . 


Now  suppose  that  a  pair  of  real  processes 
according  to  p-th  order  autoregressions, 


^ <w  w  u a  . 


generate. 


n*  1 


w  u. 
n  k-n 


.  v.  *  d, 
n  k -n  k 


i  (O  u  ♦  1  v  :  *  b,  , 
n  k-n  n  k-n  ►. 


n*  1 


where  all  the  quantities  are  real.  Then  complex  process 

P 

X  3  u  ,♦  IV,  *  ;g  X,  ♦  h.  X.*  ♦  ... 

k  k  k  “,'nk-n  r.N-n  r 
n=  1 


wihere  gn  and  hn  are  given  by  (20),  and 

Wk  ^  dk  *  lbk‘  :3 

Now  if  an  =  -un.  fn  *  vn,  for  1  ^n  _^p  in  (21',  we  get  gr  =  hr  =  _r 
*  i.n,  and  22'  vields  autoregression 

P 

*,  *  7  h  x*  *  w.  .  .1 

k  - ,  n  k -n  k 
n  *  l 
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So  a  linear  prediction  according  to  (15)  is  not  expected  to  do  too  well 
on  the  actual  waveform  values  given  by  (24),  no  matter  what  the  statis¬ 
tics  of  (wjj)  are.  However,  the  spectral  approximation  available  via 
(15)  of  Gx(f)  for  process  (24)  can  be  very  good  if  p  is  chosen  large 
enough  in  (IS). 


As  an  example,  let  p  «  1  in 

(24): 

X  m 

*k 

hxk-l  *  V 

(25) 

where  h  and  w^  are  complex  with 

IM  <  i. 

(26a) 

* 

k  k-n 

5  ,  w,  w,  »  0. 

on*  k  k-n 

(26b) 

The  excitation  process  described  in  (26b)  is  an  analytic  process,  as 
witnessed  by  the  zero  value  for  the  second  ensemble  average.  We  find 
averages 


Vk-n  ’  °>  Vk-„  *  4 on'  n  i  °- 


(2?) 


and  correlations 


R  =  x.  x; 
n  k  V-n 


1  -  h 


I h 1  ,  n  »  0,  2,  4, 
0,  n  ■  1,  3,  5, 


(28) 


ft  =  x.  x, 
n  k  k-n 


1  -  h 


0,  n  -  0,  2,  4,  . 

! h | n” 1 ,  n  -  1.  3.  5, 


(29) 


Since  (29)  is  not  zero  for  all  n,  process  {xj;}  of  (25)  is  not  an  anal¬ 
ytic  process. 

If  we  use  the  facts  (derivable  from  the  definitions  above)  that 

R  ■  R*,  ft  •  ft  ,  (30) 

-n  n  -n  n 
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we  find  that  the  spectra 

G(f)  =  a  \  R  exp  (-i2wfni) 
n*-«  n 

1  ♦  1  h  j  2 

■  A  - 5 - 1 11 - r-  . 

!  1- ! h  |  exp  (-i2*f2 L) 

and 

cm 

6(f)  :  i  J  exp  (-i2'!rfnA) 


31 


,  t  _ 2h^cos  .  3: 

| 1  -  I h | ‘  exp  (-i2»f2i) 1 * 

Generally,  spectrum  G(f)  is  real  and  positive  and  C  f'  is  complex 
and  even  about  f  =  0.  For  this  particular  example,  23',  If  is  also 
even  about  f  *  0. 

If  we  attempt  prediction  on  the  process  23i  according  to  13  an 
we  minimite 


we  find 


and 


A 

a 

n 


0 ,  n  *  l 
h  “ ,  n  *  2 


I A  ,2 

min  I  c.  I 
k 


(1  -  'hi2)*1,  p  .  1 

i  ♦  V  \  p  >  : 


Now  (341  is  hardly  the  same  result  as  the  actual  autorei;res>K'r. 
Nevertheless  we  find  spectral  approx ima t i on 


34 


;» 
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(1(f)  ■ 


Ml  -  |h|2)-1,  p  ■  1 
C(f).  p  >  2 


from  (34);  that  is,  the  spectral  approxiaation  is  exact  for  p  _>  2,  de¬ 
spite  the  obvious  error  of  (34)  in  terms  of  waveform  prediction. 

If  instead  we  attempt  prediction  on  process  (25)  according  to  (19) 
and  we  minimize 


V  5  !xv  -  V 


we  find,  of  course. 


i_  ■  °.  h 
n  n 


0,  n  4  1 
h,  n  »  1 


min  e. 


For  1  h  | 2  near  1,  the  error  (35)  for  p  >_  2  is  approximately  twice  as 
great  as  (39). 

For  the  general  autoregression  in  (22),  it  is  shown  in  appendix  B 

that  for  analytic  white  noise  (w^),  {hn}^  »  0  if  and  only  if  {#{„}£  •  0. 

Thus,  given  a  complex  data  sequence  (xn}1!1  of  unknown  origin,  we  can  de¬ 
fine  1 

N 

A  1  r 

(R  =  - -  7  xx  ,  0  <  m, 

■  N  -  *  n™-i n  "•*  - 


fc  •  i 

o  •  N 


'f  I  l*J2- 


Then,  if  <<  1  for  0  <  m  <_  q,  the  autoregressive  model  (19)  with 

(h„}j  ■  0  can  be  used  with  some  confidence  for  p  <_  q  to  predict  the  ac¬ 
tual  waveform.  But,  even  if  some  ^  f  0,  the  autoregressive  model  (19) 
with  (hn)P  ■  0  can  still  be  used  to  estimate  the  spectrum  of  the  process 
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{xjj};  however,  in  order  to  attain  equivalent  spectral  estimates,  it  has 
been  observed  that  more  data  points,  N,  are  needed  when  some  hn  ^  0 

than  when  fhn}^  s  0. 

If  a  process  is  generated  according  to  autoregression 


S'nVn  *  V 

n*l 


where  process  -'w^'  is  not  analytic,  then  {Rm}  are  not  necessarily  :ero. 
Thereby  prediction  US)  will  not  necessarily  give  accurate  predictions, 
although  the  spectral  estimate  can  still  be  adequate;  this  situation  is 
discussed  further  in  appendix  C.  Generation  of  analytic  processes  is 
considered  in  appendix  0,  and  a  more  thorough  look  at  the  prediction 
capability  of  ( 19)  is  considered  in  appendix  E. 
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5.  DISCUSSION 


A  program  for  estimating  the  autospectrum  of  a  complex  univariate 
process  via  linear  predictive  techniques  has  been  presented.  Although 
it  can  be  used  to  estimate  the  cross -spectrum  of  two  real  processes,  it 
is  not  recommended  because  estimated  values  of  magnitude-coherence 
greater  than  unity  can  result.  Instead,  the  methods  of  multivariate 
techniques  presented  in  reference  3  should  be  employed;  in  fact,  the 
theory  for  complex  multivariate  processes  is  developed  there  and  a 
working  program  given. 

Although  the  program  presented  here  presumes  that  none  of  the  data 
points  are  bad,  it  may  be  readily  generalized  to  include  bad  data 
points.  The  method  and  program  presented  in  reference  1  furnish  the 
necessary  background  for  this  extension. 

Application  of  the  linear  predictive  technique  in  (15)  is  most 
successful  when  the  complex  process  under  investiagtion  is  analytic. 
Otherwise,  the  more  general  prediction  technique  in  (19)  is  worthy  of 
consideration. 
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Appendix  A 

FORTRAN  PROGRAM  FOR  SPECTRAL  ANALYSIS 


The  following  program  for  spectral  analysis  of  a  complex  process 
consists  of  five  parts:  a  main  program  and  four  external  subroutines. 
The  subroutine  BURGCX  computes  the  complex  predictive  filter  coeffi¬ 
cients,  POWERC  computes  the  fractional  power  in  bands  (JA)-1  Hz  wide, 
MKLFFT  effects  a  fast  Fourier  transform  (reference  4) ,  and  QTRCOS  gen¬ 
erates  a  table  of  cosine  values  (reference  4) . 
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v.  special  est imation  for  complex  data 
c  usef :  change  line  is  and  replace  lines  20-33 

C  N  =  uLMBEm  of  COMPLEX  DATA  POINTS!  iNifGER  INPUT 
c  X  ( 1 ) ,  .  , , »  a  1 N )  =  complex  input  datai  altered  or.  OUTPU1 
t  PMAa  =  MAXIMUM  ORDER  OF  FIlTEH  i  INTEGe.1  ImPuT 

C  J  =  aUE  OP  FFT  tMUST  BE  a  PO’.ER  OF  ^  TO  USE  r  <i.FFT  )>  INTEGER  INPUT 
C  PBEST  =  BEST  ORDER  OF  FILTER!  iNTlGER  CUTPUT 

c  A  ( 1  a  ( PbEST )  =  COMPLEX  PReOICTIVf  FIlTEI<  r  EFFICIENTS!  CUTPUT 

C  PKOO  =  PRODUCT (1— AoS<A(P} ) **2)  FOR  P=i  TO  PutST!  uUTPulT 

C  RhO  . . .  =  CONPLtX  NOhMAL^ED  COnAcXf  TIC..S*  OUTPUT 

C  Xa(1)*...»XX{J)  =  FRACTIONAL  Po^ErS*  ol'TPuT 

C  CO( l»*...*CO(J/‘*+l)  =  &OAKTER  cOSl.UE  TABLE  FoP  fFT  PurPOSES 
0  r  IS  A  REQUIRED  COMPLEX  AUXILIARY  ARRaY 
C  Yf  is  A  HEOU1RE0  AUXILIARY  ARRaY 

Parameter  ns  100*  pmaxsio*  j=  bi2.  jai=j/a*-i 

INTEGER  PbEST 

COMPLEX  XIN) *Y(n) f m(HMAa) »UhO(PmAX) 

DIMENSION  XXtOJ »YY(j1,CO(Jai) 

C  COMPLEX  iNrUT  DATA  IN  X(l) . X<l.) 

COMPLEX  A1,2U400J 

define  lHANOSl*b*«15-M  U-SlGfJ(i*  I*S**lb)  ) /  «• )  •  j<*359738367 
DEF  INE  RANUsFLOAT <  i  i  /34B597383P7. 

I:S28l 

Al=< ,b 5, ,6b) 

2(1)=<0,*0.) 
tu  21  L=2.1400 
IsIRAND 
Hl=HAMD-yb 
1=1RANU 
R2=RANQ— yb 

21  2(L)=A1»2(L-1)mCMPLX<R1,R2) 

DO  22  1:1 • N 

22  XiUsZatlROO-N) 

PRINT  1 

1  format (ihi»»  input  oata:*) 

PRINT  4,  (All)  1  I=1#M 

c  evaluate  predictive  filter  coefficients 

call  8uRGCX(N#PmAX#XaY#PBEST#A#PR0u»RH0) 

PRINT  9,  XIN> 

9  FORMAT (/*  M£%N  :  (*, Eli. 8. *,*,cl3. &,*)*> 

R1=REAL( Y (N) 1 

PRINT  10*  HI 

10  FwRMATC  STANDARD  DEVIATION  =  '»E13.S>) 

PRINT  2,  PBEST 

2  FORMAT ( / *  PBEST  =t.l3> 

IF(PBEST,£tt.0>  GO  TO  12 
PRINT  3 

3  FORMAT!/*  PREDICTIVE  FIlTEK  COEFFICIENTS  FOR  FdEST.**) 

PRINT  4,  ( A ( 1 ) # 1=1 » PBEST ) 

*♦  FORMAT  {•f(E18,8,£15,811 

12  PRINT  S.  PROO 
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5  FORMAT  (/'  PRODUCT  1 1-ABS  l  A4P ) )  **2).  =*#El3.fe) 

PRaNT  6 

b  FQKMAT(/»  NORMAlUcD  CORRELATION  CoEFFICIt  iTS: ' ) 

PRINT  4,  (KNO(I) »I=1*PMAXJ 
CALL  QTRCOS(CO»j) 

c  Evaluate  fractional  powers 

c all  POfcERC (RUEST  #  a  t PROU  *  J » X* » Y T  *  Co  *  SUM ) 

PRINT  7 

j  formati/*  fractional  POWERS;*) 

PRINT  b,  (XX ( II # Isl » J1 
b  FOKMaT(2X»10E13.6) 

PRINT  11 •  SUM 

11  p  jKMAT ( / *  SUM  OF  FRACTIONAL  POMlRS  =*»E13.b) 
t  NU 


■'Oi-ROu  I  1NL  i  URuCX  ( is » PMAx  »  X  *  V  »PU>-  ST  ,  A  »  PPC> »  PH  j  j 
C  Tni‘.»  BU-jlw  TINE  CuaPOTlS.  TmE  COMPLEX  pFEDICTlVE  F  ILTL*  C C’lFFIC  TENTS 
c  . i  —  '_,Ln  OP  COMPlEX  L  aTa  POI’  TSJ  luff GER  INPuT 

l  (>;,;>  -  ’-A a  1 RIUN  OKucK  oF  FIi_TLk;  Ii«T£Gf»-  IuPul 

L  AU),a(*>»...»aIN)  =  v-U'*iPLCX  UaTA  A.RRaY  On  IFPUT*  ALTEKE.;  ON  OUTPUT 
L  O.i  >,uT  PUT  t  X(1 )  »X12)  »  .  .  .  #X(PMAY  )  =■  A  ( i  }  PPAX  )  .  *  (  2  » P  *A  A )  *  .  .  ,  »  A  ( P*  AX  f  P*’ AX  ) 

l  y  a)  ,iu)  »  f,..tiN)  =  complex  auxiliary  array*  so^tcp  input 

l  Gu  CoTPUTf  Y  ( I )  t  Y  )#..»>  Y  ( PFiA*  )  =  A ( j 5 1 ) » A (2 ! 2 ) » . • • i A i P  AX  * pkAX ) 

c  O,.  colPUT,  X(N)  =  -iEAn.  Aflu  Y(p)  =  STAf OAKc  l EVICTION  OF  INPUT  DATA 

C  PaEbT  =  BEST  GRUEk  OF  P  ILTt.R*  INTEGER  OUTPUT 

C  A(I)  ,M(2),.#.,A(Pbe.ST)  =  COMPLEX  rREDiCTWE  FILTER  COEFFICIENT  ARnAY  = 
c  A  ( 1 ;  PuLhT  j  ,  A  (2  #Pt)EST  )*.♦.»«  < PofST  J P3EsT  )  *  OUTPUT 
C  Pkoi  =  PHj'  uCT(l-AbS(M(PJp„EST)  )*'*c>  fCR  P=1  TO  foESTJ  OUTPUT 
s.  RriOl  A)  f  .  .  .  »RH0(HF,AX)  =  COMPLEX  NOrt  'ALliEP  COROLLA  T  ICnS  *  OUTPUT 
l  Co'>LbX  X(,.1.Y(N),A(PmAX),aHG(pMAX)  Ts  PECuIPE'J  IN  r.AlN  pRObRAM 
INTEGER  PM,aa»PolST#P 
wtjOOLE  .KcClSlON  SmK»Sa1#SI; 

CumPlEa  Si*b*T 

ComPLEa  X(1)*Y(1)*m(1)»  r»HO  ( l ) 

IF  (P^AX.GT.3.*bv,(RT  (r.i)  PRINT  2#  PMAX»N 
i  Fv^rv.aTI/'  Pi-AX  =‘.14»»  IS  TrO  LaRGe  FOR  «u*Pit<  OF  CATa  POINTS  u  =• 

S,  lb) 

C  COvPofE  i-'t-AN 

si=( j.*:.) 

C 0  1  1=] • N 
1  ti=Sl>X{l) 

S 1 =CPPLA (KLAL(S1)/U,aImaG(Si )/N) 
w  SuUTKaCT  iitAru  aNu  SCmLE  TC  UNjT  vARIAfCE 
3^=0  . 

i  J  3  1  =  1 »N 

> ii)=xtl)-si 

O  S*=S2+*EAL(X(lJ  )**£  +  AlF»AG(X{I))»*2 

Lt=S0HT (Si/(N-1.) ) 

,:  =  1./S2 
L  O  3  1  =  1 »N 

a  v  a  )  =Ci>.r'LX  ( PEAL  (  X  (  i  }  l*df  AIMaG  (  X  ( 1 1  j  *B ) 

S  til)=X(I> 
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C  B£G1N  RECURSION 

t-=c 

Fr»OOUC=l  . 

MiCtaINz''  t 

rritSTsO 
V  huD  =  l . 
rZr'+l 

CmU.ul.ATE  cKOSS-GAINJ  LU.  155 
Sak=';  .u"1 
SmXsO.O'* 
bd=O.DC 
LiP  +  1 

Lu  7  I=L*N 
11=KEAL(XU)1 
U=Alt*.AC-»X(I)  ) 

TO=REALtT(X-D  ) 
l4=AUMdtY(i-li  ) 

SAK-SArv*Tl*TA*T£*TH 
3Al=S«l+T<l*T.5-Tl*T4 

SB=SB+Tl**2+T2**2+T3l‘*2+T4**2 
h=2»*SAK/Sb 
C-2.*SAI/Sd 
G=CMPLX(B.C) 
fcsl ,-B*B*C*C 

hhoouc=pkoduc*b 

CALCULATE  FILTEB  COEFFICIENTS*  EQS.  l6C«l46.  STOhC  In  X ( 1 X ( P ) 
X(P)=G 

IF (P.Eu, 1)  GO  TO  8 
L=P/2 

CO  9  1  =  1, L 

T=X ( I ) -&*CONJG ( X (P-II ) 

*(P-I)=xiP-n-G*COnUG(X(I)) 

X(I)=T 

CALCULATE  NORMALIZED  CORRELATION  COEFf IC IENT >  E  j.  149 
T-X(P) 

IF(P.EQ.l)  GO  TO  14 
L=P-1 

CO  15  I =1 » L 
15  T=T-*-X(I)*RHOiP-I) 

14  RHO(P)=T 

c  calculate  a&mkE’s  information  criterion*  egg.  isoa202 

R£LERR=B*SNGL  1  SB )  /  1 2  »  *  ( u-p )  ) 

A iCrLOG ( RELEBRl+4 , VFLOAT (P )  /  (N-P  Jt 
I F ( A ICjGE • A I CM I N )  GO  TO  10 
aTCmITC=AIC 
PdEST=P 
PKOOsPROPUC 
CU  11  I=4»P 
11  A(I)=X(Ii 

10  lFtP,EG,PMAXi  GO  TO  16 

C  UPDATE  FORWARD  AND  BACK*ARu  SEQUENCES,  EQ,  u>3 
L=P*l 

DO  12  I=N*L»-1 

T=X(I)-G*TiI-li 

T{I)=T(I»U-CONjG<G1*X(I) 
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12  X(I)=T 
Y(P)=G 
60  TO  6 

lt>  Y  (PMAX)zG 

IF IP8EST jEQ.RMAX)  60  TO  4 

c  compute  extrapolated  normalized  correlation 

C  COEFFICIENTS  FROM  PBESTfl  TO  PmAX*  EQ.  165 
L=PBEST+1 
UO  17  PsL*PMAX 
AlP)=(0,»O.) 

T=(0.,0.1 
UO  la  l=l*PbEST 
18  T=T+A(I)*RHO(P-I) 

17  RHO(P)zT 

4  x (N)zSl 

Y(N)=CmPLX(S2>0.) 

RETURN 

EnU 


SuoROUl  INC.  pO»£f<C(Pbt.ST#  A,P«OD#u»Xx»  YY,CO,SU  - ) 

c  Tnis  subroutine  computes  the  fractional  po.<£fs  ii.  ba,.„s  i/(j»uflta> 
o  Pdtsr  =  best  order  of  filter i  integer  input 

C  A  . . . . A(PBEST)  Z  COMPLEX  FIlTEK  COeFFICIFM  4nRAY{  INPUT 

C  PKOL  =  PROUUCT(1-AbS(a(P} )**2> ,FOR  P=l  TO  PoESTj  INPuT 
C  J  =  SIZE  OP  FFTI  INTEGER  InPUT 

c  xx  =  auxiliary  array  on  input 

C  XXll)*...»XX(U)  =  FRACTIONAL  Powers  0;.  OUTPUT 
C  rr  =  AUXILIARY  ARRAY  I  SCRATCH  INPUT 

c  Com *...»cO(j/h+d  ?  guarteh  cosi.je  table  fo-'  fft»  Input 

C  DIMENSION  XX(J)  .YY  (Jl.C0(J/4+l)  is  REQUIRED  IN  I-  a  If  PROGRAM 
0  COMPutX  A(pMAX)  IS  RlGUIRlO  Iv  MAIN  ppOGKA  '»  •.ifcnE  I-  '.AX.&E.rBFSl 

integer  pbest 

DIMENSION  XXUl  #YY(I>#CO(l) 

COMPLEX  All) 

FzPROD/j 

XXU)=1. 

YY(1)=C. 

IF (PdEST.EO.C)  GO  10  4 

lo  i  i= i» pbest 

XX ( 1+1 )  ztREAL ( A ( II ) 

1  YT ( 1*1 ) z*AIMAG ( A ( I } } 

4  L=PBEST+2 

CO  2  IzL * u 
XX(I)Z0. 

2  YY(I)=0. 

Lzl.4427*LOG( J)+..5  0  LOG2CJ* 

call  mklFFT(XX#yy»co»l#-1) 

SUM=0« 

UO  3  1=1 fU 

XXU)=F/«AX(I)’**2tYY<IJ**2) 

3  SUM=SUN^XX( 1 ) 

RETURN 

ENo 
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8uof<QUT  iNt  eiM.Fr  T  (At  Y  #C0*;w,'»  JSNJ 
i.  iNi&rJSlCN  Xtl)  .t  (1)  ,CC(i)  ,L(12J 

G.'OUl  w  A^tNC£  <L12*LUi  »  *  (LU  ,L<2)  >  *  (tlO,L(o)  )  .  u.9,t_<4>  )  ,  <Lu,l  («H)  )  , 
llt_7*i-(o)i*  <t6*L(  U)  *  U.d,L<A)  ) »  <L4,l/9>  >  *  <t3.(_Un>  >  *  <L.c,L<ll)  )  , 
2(U»U12)) 
l.s2**w 

KU4IU/4 

I  C+PZZtrJ 4P1U 
,.u2P2=Wv4+ND*»P2 

t.0  8  LO=l*<’i 
t>»AS2**  (K-UO) 

LIX=2*L  'X 
I8CL=.VtI* 

«.o  8  U«=1*LSX 
lAKGi(t- *1 ) *iSOu+l 
IF  UARG,LL.riD4Hl. )  oC  TO  4 
Cs*CC  (i\Q2P<-XAkm) 

SiiSl<*CC  ( IAhG-uuH) 

00  TO  o 
4  c=CC(ImRG) 

8=iS,  I*CC  <N0aR2-I AFoJ 
O  i,v  8  LI=LIX.fJ*UX 
ol=LI-LIX+Ll'' 
wC=01+L'-X 

Tlsx(Ji)-*X(u2J 
Tc=r<ju*r  <^,2) 

A<Jl)=X(vil)+Xlj2> 

X  <o2)  =C*Tl«i>f  T2 
T |02)=C*T2+S*T1 
o  tunTlNuE 
LQ  4C  0=1*12 

L<0)=1 

IFIO-*)  il.31*4C 

31  no)  =2**  <l»  +  l-o) 

**C  CO.NTINUf 
wN=l 

to  80  01*1*  Li 
l  o  60  o2?ol  *l2*i.1 

lai  oO  o3*o2  *  L.3  *  t.2 
LO  8C  045J3.t4,uO 
to  60  08?04.Lb*c4 
(,o  60  J6t08*L6*LS 
tO  60  07?Ot>  *  L7  i  i»6 
uO  60  jA507*L8»v.7 
uO  60  o9Sod*U9/t6 
(,0  60  o1Q=J9*L10*L9 
Do  60  Oli=Jl0*LU*tl0 
tO  6C  o«?JU*L12*LU 
lF(0(<-0ril  81*81,82 
Si  KsX(jM) 

X  <ON ) —A ( JKJ 
A l Jrt ) =K 
F 1  =  Y  <JN) 
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Y(JU)=Y(Jk) 
Y{JR)=FI 
52  JN- JN+1 
60  CONTINUE 
RETURN 


EwU 


SooKOUTINE  oTKCoS < C » hJ 
LIhENSION  C(l) 
Ih1:,j/4+1 
ScL=6.2o3idb307/N 
CO  1  I=1fN41 

i  C(I)=C05Ul-l)*bCU 

nfcTUHN 

LiMu 


A- 7/ A- 8 
Reverse  Blank 
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Appendix  B 

PROPERTY  OF  AUTOREGRESSIVE  MODEL 

The  process  of  interest  here  is  given  by  the  autoregressive  model 


(22): 


*k  ’  j/Vk-n  *  Vk-n>  *  V 


where  excitation  {w^}  is  analytic  white  noise.  That  is. 


(B-l) 


w,  wf  ■  6  , 

k  k-m  om 


w,  w,  *  0 . 
k  k-m 


(B-2) 


It  then  follows  easily  that 


Vk-„  *  5 on'  Vk-»  "  °-  m  -  °- 


(B-3) 


Use  of  (B-2)  and  (B-3)  then  yields  correlations 

P 

R_  =  \\  _  -  l  (g  *  _  ♦  h  R*  ) ,  m  >  0, 

m  K  K-m  _  ,  n  m-n  n  m-n 
n*  1 


( B— 4) 


R_  =  x,  x.*  _  =  l  (gnRn 
n=l 


h  R*  )  +  <5  ,  m  >  0.  (B-5) 


For  given  coefficients  (g  and  (hn)P,  (B-4)  and  (B-5)  constitute  si¬ 
multaneous  equations  in  tne  unknown  correlations. 


Now  let  us  suppose  that 

h  *0,  l<n<p 
n  —  —  r 


(B-6) 


Then,  from  (B-4),  the  first  p  ♦  1  equations  yield 


B-l 
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* 

o 


0 


Therefore 


K  -  g.K  g  R  «  0. 

p  B1  p-1  6p  o 


<R  *  0,  0  <  b  <  p, 

7T>  —  - — 


(B-") 


'  B-81 


and  from  (B-4)  and  (B-6)  it  follows  that  is  :ero  for  all  m. 
Conversely,  assume  that 

I?  *  0,  0  <  m  <  p. 

Then,  from  (B-4),  the  first  p  equations  yield 

h, R,  ♦  h_R.  *  ♦  h  R  =0 

11  2  2  p  p 


'B-9' 


h.R,,  ♦  h.R.  ♦ 

1  2-p  2  3-p 


h  R  *  0. 
P  1 


Therefore 


(B- 10) 


h^  =  0,  1  ^  n  <_  p,  (B-l  1) 

and  from  (B-4)  and  (B-l 1 )  it  follows  that  is  :ero  for  all  m. 

Thus  we  have  shown  that  (hn)p  *  0  if  and  only  if  (Rm>P  »  0  in  the 
autoregression  (B-l)  with  analytic  white  noise  excitation. 


B-2 
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Appendix  C 

NONANA LYTIC  WHITE  NOISE  EXCITATION 


Suppose  a  process  is  generated  according  to  autoregression 

P 


T  g  x. 

*•,  n  k-n 
n*l 


V 


(C-l) 


where  excitation  {w^}  satisfies 


w,  wj  »  B6  . 
k  k-m  oral 


w.w,  *  66 
k  k-m  om 


(C-2) 


6  is  complex  and  nonzero;  therefore  {w^}  is  not  an  analytic  process, 
although  it  is  white. 

Then  {•?„,}  need  not  equal  zero,  even  for  autoregression  (C-l).  Fot 
example,  for 

P  *  1.  ■  g.  I g !  <  g  complex,  (C-3) 

we  find  correlations 


R 

o 


R 

m 


rn  ,  _ 

g  R0.  m  i  1»  R 


(C-4) 


R 

o 


g0^  ,  m  >  1 ,  R  *  R  . 
6  o  —  -m  m 


The  corresponding  spectra  are 

G(f)  - 


AB 

- i 2wf A  i  2 


1  -  ge 


(C-S) 


(C-6) 


C(f)  -  - - - y  .  (C-7) 

1  -  2g  cos  (2*fA)  ♦  g 
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Since  the  {IJj,}  are  not  zero,  appendix  B  shows  that  the  coeffi¬ 
cients  (hn)  would  not  be  zero  in  model  (B-l)  with  an  analytic  excita¬ 
tion. 

Optimum  prediction  on  process  (C-l)  using  (15)  gives 

g,  n  *  1 

0,  n  i  1  ’  (C-8) 

with  a  minimum  mean  square  error  equal  to  B,  and  the  spectral  estimate 
is  identically  (C-6) .  Thus,  for  this  example,  the  nonanalyticity  of 
the  excitation  is  no  problem. 

For  the  more  general  model  of  (C-l)  with  p  >  1,  it  can  be  shown 
that  all  { Rm }  are  independent  of  the  value  of  6.  Then,  although  (15) 
may  not  be  too  accurate  for  waveform  prediction,  it  can  still  be  used 
for  spectral  estimation  purposes. 

Optimum  prediction  on  process  (C-l)  using  (19)  gives 

gj  *  g,  other  coefficients  =  0,  (C-9) 

with  a  minimum  mean  square  error  equal  to  B.  This  yields  the  same  re¬ 
sult  as  above. 


C-2 
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Appendix  D 

A  METHOD  OF  GENERATING  ANALYTIC  PROCESSES 


Suppose  linear  filter  H(f)  is  excited  by  complex  input  x(t), 
yielding  output  y(t) .  Then  correlation 


R(t)  =  y(t)y*(t  -  t)  «  /  df  exp  (i2rfT)G  (f)  |H(f) \2  ,  CD-I) 
X  ^ 

and  spectrum 


G  (f)  =  /  dt  exp  (-i2ffft)R  (t)  *  G  ff) |H(f) |  “ .  CD-2) 

X  X  A 

Also  correlation 

«y(T)  5  y(t)y (t  -  t) 

*  /  df  exp  (i2TrfT)«x(f)H(f)  H(-f),  CD-3) 


and  spectrum 

®(f)  =  /  dr  exp  (-i2irft)R  (t)  *  C  (f)  H(f)  H(-f)  .  CD-4) 

X  X  * 

Complex  process  y(t)  is  defined  as  being  analytic  if  CD- 3)  is  zero 
for  all  t.  Suppose  that  filter 

H (f )  ■  0  for  f  <  0.  (D-5) 

Denote  the  output  of  filter  CD- 5)  by  y«.(t).  Then  CD-4)  shows  that 
£y(f)  and  Ry(t)  are  both  identically  zero  for  all  argument  values. 
Therefore  single-sided  waveform  y+(t)  is  an  analytic  process. 

Let  complex  envelope 

£(t)  s  y+(t)  exp  (-i2irfot).  (D-6) 


D-l 
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f?  fr‘  =  v  f t '  v *  '  t  -  t".  =  R  (t)  exp  t  , 

\*  v  —  —  V  +  o 


ro-") 


for  an’.'  f  0 .  Thus  the  complex  envelope  of  any  stationary  process  is  an 
anal'-'tic  process. 


hr.  the  other  hand,  for  the  two  real  processes  ut!  and  v  tl  ,  no 
linear  combination,  vu  t  *  £v  t  ,  where  i  and  :  are  complex,  ever 
yields  an  analytic  process  unless  Ruu  '  ,  "w  *  ■  ar‘^  '  satisfy 
verv  special  restrictions.  Thus  the  process  constructed  in  1  was  not 
anal- tic  and  could  net  have  been  expected  to  yield  good  prediction  cap- 
a b i  1 : t  via  I!  . 


n- 
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Appendix  E 

CAPABILITY  OF  A  MORE  GENERAL  PREDICTION  MODEL 
If  p  is  infinite  in  (19),  we  have  prediction 

+  h  x;  J  • 


x.  =  I  (g  x. 
n=l 


k  _L,  V6n  k-n  “n  k-n' 


(E-l) 


Minimization  of  |qj2  =  |xk  -  xj< | 2  yields  the  simultaneous  equations 


V  (gR  +  h  R*  )  =  R  ,  1  <  m, 
V6n  m-n  n  m-n'  m’  - 

n=l 


n*l 


(gfi  ♦  h  R*  )  *  K  ,  1  <  m. 
V6n  m-n  n  m-n'  m’  - 


(E-2) 


It  can  then  be  shown  that  is  a  white  process  with 


mmlij1  -  Ro  -  I  VJ  • 

n- 1 


(E-3) 


Also  it  can  be  shown  that 


T,  T,  =  0  for  m  i  0, 
k  k-m 


(E-4) 


with 


7,  2  «  R  .  [  (g  £  +  h  R  ) . 

k  o  u.  V6n  n  n  n' 
n*  l 


(E-5) 


However,  is  not  an  analytic  process  since  (E-5)  is  not  zero. 
The  simplest  example  to  demonstrate  this  is 


R  =  R  6  ,  R  =  R  6  , 

n  o  on’  n  o  on’ 


(E-6) 


for  which  (E-3)  and  (E-5)  yield  R0  and  R0,  respectively. 
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The  spectral  relations  for  (E-l)  take  the  forms 
G-(f)  =  G  (f)  |A(f)  |  2  +  G  (>f)  |B(f)  |  2 

X  a  A 


*cm  A(f)  B*  (f)  +  [C  (f)A(f)B*(f)l*  (E-7) 

X  A 

and 

C?(f)  =  «x(f)A(f)A(-f)  *  C*Cf]BCf)BC-f) 


where 


and 


*  G  (f)A(f)BC-f)  ♦  G  (-f)A(-f)BCf), 

X  X 


Aff)  =  l  gn  exp  (-i2-  ~r\L) 
n=l  n 

30 

B(f)  =  l  h  exp  (-i2Tfn.il 
n=l  n 


(E-8) 


(E-9) 


are  considered  known  after  solution  of  (E-2>  for  coefficients  (gn;  and 
{hn:.  Equations  fE-'”'  and  (E-S)  can  be  solved  for  Gx(f)  and  C-x  ( f )  : 


A  A* 

B  B* 

A  B 

B  B* 

A  A* 

A  B 

A_  B 

A  B  _ 

A  A 

A*B* 

A*B* 

B*B 

where 

A 

G 

x 

€ 


A*  B 

V 

A*B 

G 

X 

X 

B  B 

G 

Gv 

X 

X 

A*  A* 

o*  ■ 

fit 

L  ^  j 

x_ 

A  (  f )  , 

A  = 

A  ( -  f )  , 

G  (tl 

.  G 

5  G  C-f) 

X 

X 

X 

C  ;  f  A 

X 

• r; 

5  «*(f'  - 

X 

(E- 10) 
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This  requires  the  inverse  of  a  4  x  4  complex  matrix  at  each  value  of 
frequency  f. 

Thus  an  alternative  spectral  estimation  technique  is  available 
from  the  more  general  prediction  model  in  (E-l).  Whether  it  is  worth¬ 
while  in  terms  of  stability  and  resolution  is  unknown,  as  it  has  not 
been  pursued. 
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Probability  Distribution  of 
Spectral  Estimates  Obtained  Via 
Overlapped  FFT  Processing  of 

Windowed  Data 


Albert  H.  Nuttall 
ABSTRACT 


The  characteristic  function  of  spectral  estimates 
obtained  via  overlapped  FFT  processing  of  windowed 
data  is  presented  for  a  random  process  containing  a 
signal  tone  and  Gaussian  noise.  For  the  special  case 
of  noise-alone,  the  probability  distribution  of  the 
estimate  is  plotted  and  compared  with  an  approxima¬ 
tion  utilizing  only  the  first  two  moments  and  found 
to  be  in  excellent  agreement  in  probability  over 
the  range  (.0001,  .9999)  for  several  data  windows, 
overlaps,  and  time-bandwidth  products.  This  result 
means  that  knowledge  of  the  equivalent  degrees  of 
freedom  of  the  spectral  estimate  is  adequate  for  a 
complete  probabilistic  description,  even  when  the 
overlap  results  in  significant  statistical  depend¬ 
ence  of  the  component  FFT  outputs. 
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PROBABILITY  DISTRIBUTION  OF  SPECTRAL  ESTIMATES 
OBTAINED  VIA  OVERLAPPED  FFT  PROCESSING 
OF  WINDOWED  DATA 

INTRODUCTION 

Estimation  of  the  autospectrum  of  a  stationary  random  process  by 
means  of  overlapped  FFT  processing  of  windowed  data  (the  so-called 
direct  method)  is  a  popular  and  efficient  method,  especially  for  data 
with  pure  tones  present.  Stable  spectral  estimates,  as  measured  by 
the  equivalent  degrees  of  freedom  of  the  spectral  estimate,  result 
when  the  product  of  the  available  record  length  and  the  desired 
frequency  resolution  (the  time-bandwidth  product)  is  large  in  com¬ 
parison  with  unity.  (See,  for  example,  references  1  and  2  and  the 
references  listed  therein.) 

The  equivalent  degrees  of  freedom  of  the  spectral  estimate  is  an 
incomplete  probabilistic  descriptor,  because  it  depends  only  on  the 
mean  and  variance  of  the  random  variable.  In  this  report,  we  address 
the  problem  of  obtaining  the  characteristic  function  of  the  spectral 
estimate  with  overlap  processing,  of  a  signal  tone  present  in  Gaussian 
noise,  and  thence  the  cumulative  probability  distribution  (perhaps  by 
numerical  means  as  given  in  references  3  and  4) .  For  the  case  of 
signal-absent  also,  wo-  will  compare  the  exact  probability  distribution 
with  an  approximate  distribution  that  uses  only  the  first  two  moments 
of  the  spectral  estimate,  to  see  when  the  approximate  distribution  can 
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be  used  as  a  valid  probabilistic  description.  Some  related  work  is 
available  in  reference  5  and  the  papers  cited  therein. 

A  discussion  of  the  relative  stability  of  the  spectral  estimates 
with  signal  tones  present,  and  of  a  cross -spectral  estimate, completes 
the  presentation. 

CHARACTERISTIC  FUNCTF'l  FOR  SIGNAL  PLUS  NOISE 


The  method  and  conditions  of  processing  are  described  fully  in 
reference  1  and,  for  sake  of  brevity,  will  not  be  repeated  here.  The 
power  spectral  estimate  at  analysis  frequency,  f,  is  given  by 
(reference  1,  pp .  2-4) 


G(f) 


r. 


where  P  is  the  total  number  of  weighted  data  segments.  Here’ 


Yp(f) 


exp ( - i 2mft ) 


x(t) 


w 


t 


IP 


where  x(t)  is  the  available  (complex)  data  process,  w(.t  is  the  data 
window  of  length  L,  and  S  is  the  amount  of  shift  each  adjacent  data 
window  undergoes.  The  fractional  overlap  is  therefore  1  -  S  L. 


Integrals  without  limits  are  over  the  range  of  non-cero 


:  :it  egra 
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where  {Y  }  are  complex  constants  and  {Y„„}  are  complex  correlated 
ps  Pn 

Gaussian  zero-mean  random  variables,  and  the  correlation  is  due  to  the 
overlapped  processing. 

In  appendix  A,  the  characteristic  function  of  forms  like  (8)  is 
evaluated;  it  specializes  here  to  the  form 


P  I 

c(0  =  n  (l-iApC/P) 

P"1  I 


exp 


Ai»Piy/p 

\  i-iy/p 


i 


19) 


where  {a  ; 
P 


are  the  eigenvalues  of  P  x  P  matrix 


K  = 


(101 


and 

H=QHK  _im( 

where  Q  is  the  normalized  modal  matrix  of  K  ,  and 


The  evaluation  of  K  m  (10)  is  considered  in  appendix  B. 


1  1 


12  i 


g i sen 


[%-p] 


=  G  (  fl  i  (OR, 


where  Gif'  i>  the  noise  spectral  density  at  analysis  frequency ,  r, 


if  (T) 

w 


w  1 1 )  w  *  ( t  - : )  ; 


14  i 
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and 


[Vp] 


1  r  r  . . .r 

1  2  P-1 

r  • 

-1  • 


1-P 


(15) 


where 


$  (mS) 

r  =  — - 

m  ^(0) 


(16) 


A  Fourier  transformation  of  (9)  would  yield  the  probability 
density  function  of  the  spectral  estimate  (8),  for  a  tone  present. 
This  would  have  to  be  done  numerically,  but  has  not  been  pursued  here. 

MEAN  AND  VARIANCE  FOR  SIGNAL  PLUS  NOISE 
By  means  of  (A-16),  the  mean  and  variance*  of  spectral  estimate, 

A 

G(f),  in  (8)  can  be  expressed  as 


Mean  {c(f)|  =  Kq  - 


(17) 


k-1 


Var 


m  H  K 


(18) 


k=l -P 

in  terms  of  the  quantities  in  (12)  and  (13).  Employing  the  explicit 
relationships  in  (12)  and  (13),  there  follows 

.  2  I  ut  r  e  c  ^  I  2 


, (f )4>w (0)  *  A^W(f-fo) 


Mean  |G(f)|  *  Gn< 

•More  generally,  the  cumulants  are  given  by  (A-7) . 


(19) 
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var  {5(f)}  •  [=„(0»W(°)J  ’  f  (''  -t)  |rk’J 

k=  1-P 
P-1 

+  2A2|w(f-fo)|2Gn(f)<J>w(0)pr^  (l -  i|iVk  exp^ik2-r(f-fo)S 

k=l-P  '  '  " 


where  we  have  employed  (15)  and  (5)  . 

At  this  point,  it  is  convenient  to  define  the  output  signal  power 

of  a  window  filter  with  transfer  function,  W,  centered  at  f  as 

Q  (f)  =  A2 | W(f-f  ) j2  ,  (21) 

S  0 

and  the  output  noise  power  of  the  same  filter  as 


■S‘ 


Vf)  -/dp  Gn(p)|W(p-f)| 


Gn(f) 


du|W(y-f ) 


I  2 


=  GffU  (0) 

n  w 


Then  (19)  and  (20)  take  the  forms 

Mean  {6(f)}  =  Qn(f)  *  Qs(f) 

md  P~*  /  |  k  i  \ 

Var  {6(f)}  -  Q2(f)  (l-  T)lrk!2 

k=  1  -P 


25 ) 


.2Qs(f)Qn(f)i^ 

k=l-P 

From  (24),  we  see  that  the  presence  of  signal  (A  *  0)  always  increases 
the  absolute  level  of  the  variance  of  the  spectral  estimate  over  that 
for  noise-alone.  If  the  noise  is  absent,  the  variance  of  the  estimate 
is  cero.  If  the  signal  is  absent,  the  equivalent  degrees  of  freedom, 
defined  as 


^  r^  exp(ik2m(f-fQ)S)  . 
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2 (Mean) 2 

EDF  =  - 

n  Variance 


t  S  (■■  t) 


k=l-P 


is  identical  to  equation  (8),  reference  1,  as  it  should  be. 
On  the  other  hand,  for  Qs(f)  »  Qn(f), 


_  2 (Mean) 2  ^ 
EDFs  ~  Variance 


Qs(f) 


5"(f>  f  (*■  exp(ik2,,(t-fo)s) 


.  (26) 


k=l-P 

When  a  strong  signal  is  present,  EDFS  is  larger  than  EDFn  by  approxi¬ 
mately  the  ratio  j  Qs(f)/Qn(f)  >;>  1,  since  the  ratio  of  sums  in  (25) 
and  (26)  is  approximately  unity  for  f  ■  ffl  and  reasonable  overlaps 
(see  (27)  below).  That  is,  the  relative  fluctuation  in  the  spectral 
estimate  is  reduced  by  the  addition  of  signal,  even  though  the  absolute 
variance  increases. 

For  Hanning  weighting  and  50%  overlap  (S  =  L/2) ,  we  find  r  *  1, 
r+j  =  1/6,  =  0  for  k  £  2 .  Then  the  two  sums  in  (25)  and  (26)  take  on 

the  values 

1  ♦  (l-£)jy  ,  1  ♦  (l4)j  cos  [2ir(f-f0js]  ,  (27) 

respectively.  The  former  value  is  slightly  larger  than  unity,  whereas 
the  latter  value  varies  between  approximately  2/3  and  4/3,  depending  on 
the  exact  location  of  the  signal  tone  frequency,  f  ,  with  respect  to 
the  analysis  frequency,  f.  For  an  FFT  approach,  at  least  one  bin  has 
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its  frequency  location,  f,  such  that  | f-f  | £  (2L) thus,  at  least 
one  frequency  bin  is  located  such  that  the  latter  value  in  (27)  is 
larger  than  unity. 

Figure  1A  represents  the  power  spectral  estimate,  (1),  plotted  on 
a  linear  scale  proportional  to  watts.  The  "ribbon  width"  in  the  region 
of  noise-alone  is  denoted  by  a.  The  amount  of  fluctuation  of  the 
estimate  at  f  is  denoted  by  b  and  is  larger  than  a.  (The  quantity  b 
is  observable  only  by  rerunning  the  spectral  estimation  procedure  for 
independent  noise  segments.) 


A 


G<») 


10  log  G(f) 


Figure  1.  Spectral  Estimates  for  Signal  Plus  Noise 


If,  instead,  the  power  spectral  estimate  is  plotted  on  a  dB  scale 
(see  figure  IB),  the  noise-alone  ribbon  width,  c,  is  larger  than  the 
fluctuation,  d,  of  the  estimate  at  f  .  The  mathematics  behind  this 
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conclusion  follows.  Let  the  spectral  estimate  at  frequency  f  be 
expressed  as 


6(f)  = 


m+x. 


(28) 


where  m  is  non-random  and  x  has  zero-mean  and  variance  o2.  Then 

<fB  =  10  log  2(f)  =  10  log  m  *  10  log(l+*).  (29) 

m 

Now  suppose  that  a/m<<l,  which  could  be  realized  by  means  of  a  large 

number  of  pieces,  P,  or  a  high  signal  to  noise  ratio;  then 

dB  S  io  log  m  +  -i£_  -  .  (30) 

In  10  m 

The  last  term  in  (30)  is  proportional  to  the  relative  stability  of  the 
spectral  estimate  (28);  in  fact 


(31) 


which  can  be  made  arbitrarily  small.  Thus  a  plot  like  figure  1  is 
easily  achievable  and  should  be  anticipated  for  a  pure  tone  in  Gaussian 
noise. 


PROBABILITY  DISTRIBUTION  FOR  NOISE-ALONE 

A 

For  noise-alone,  the  mean  and  variance  of  spectral  estimate,  G(f), 
are  available  from  (19),  (20),  and  (16)  as 

Mean  {2(f)}  =  Gn(f)<t>w(0) , 

Var  {5(f)}  -  G’(f)  ‘  ^  (l-  |^(kS)|!  ,  (32) 

k=l-P 
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which  agree  with  equations  (5)  and  (6),  reference  1,  respectively. 
More  generally,  the  characteristic  function  follows  from  (9)  as 


C(£)  = 


P 


n 

p=i 


-i 


(33) 


Now  let  us  define  a  normalized  random  variable 


g 

notice  that  the  scale  factor 
overlap.  Thus  the  mean  E { g } 


is  independent  of  P  and  the  amount  of 
=  1,  and  the  characteristic  function  of 


A 


g  is 


C 

g 


re) 


33 


where  {a  ^  ;  are  the  eigenvalues  of  matrix  R  in  (15).  Then  by  a 
partial  fraction  expansion,  the  probability  that  random  variable  g 
remains  below  a  threshold  value, v,  is  found  to  be 


where 


Prob  (g<v) 


v>0 


p^k 


i  3b 


l  3" ) 
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We  have  assumed  all  the  eigenvalues  of  R  to  be  unequal;  this  is  the 
case  if  the  overlap  is  greater  than  0,  which  is  the  case  of  most 
practical  interest.  The  eigenvalues  are  all  non-negative  since  R  is 
a  non-negative  definite  matrix  (see  appendix  B) . 

Equation  (36)  is  an  exact  expression  for  the  cumulative  probability 
distribution  in  terms  of  the  eigenvalues  of  matrix  R.  If  we  consider 
another  random  variable,  t,  with  the  same  mean  and  variance  as  g,  a 
candidate  approximate  characteristic  function  is  (guided  by  form  (35)) 

Ct(£)«  (l-i£/b)'b  ,  (38) 

where,  in  order  to  maintain  the  same  variance,  we  choose 


Equation  (8),  reference  1,  shows  that  b  =  K/2,  i.e.,  half  of  the 
equivalent  degrees  of  freedom.  Then  the  approximate  probability 
density  function  is 

p(t) - - -  bbtb_1  e'bt  ,  t>0,  (40) 

r  (b) 

and  the  approximate  cumulative  probability  distribution  is  (equations 
6,5.2  and  6.5.12,  reference  6): 


_ - _  (bv)Vbv  F  (1  ;l+b;bv) ,  v>0. 

r(b+l)  1 


(41) 


11 


TR  5529 


(A  further  simpler  approximation,  not  pursued  here,  would  be  to  set 
=  integer  part  of  b ,  b-,  =  b^  +  1 ,  and  bracket  the  results  above  by 
two  simpler  sums.) 

We  shall  now  make  quantitative  comparisons  between  exact  result 
(36)  and  approximation  (41)  which  has  the  same  mean  and  variance.  The 
question  is:  is  b  in  (39)  and  (41)  a  sufficient  statistic  to  accurately 
quantitatively  describe  the  exact  cumulative  probability  distribution 
(36),  for  representative  data  windows,  overlap,  number  of  pieces,  and 
time-bandwidth  products,  over  the  range  of  probabilities  of  interest 
to  most  users'?  If  so,  then  attention  can  be  confined  to  the  equivalent 
degrees  of  freedom  and  its  maximization  alone,  as  was  done  in  reference 
1;  this  simplification  would  be  most  worthwhile  and  of  obvious  impor¬ 
tance  . 

The  actual  numerical  computation  of  the  cumulative  probability 
distribution  Prob(gv),  is  considered  in  appendix  (2.  In  figure  2,  the 
exact  cumulative  probability  distribution  for  Hanning  windowing  is 
presented  for  time-bandwidth  product  BT  =  S,  In,  32,  oi ,  where  2  is 
the  available  record  length  and  B  is  the  desired  resolution  bandwidth. 
In  each  plot,  the  overlap  is  varied  from  0  to  approximate  1>  "3.,  and 
the  distribution  plotted  on  a  normal  probability  ordinate  covering  the 
range  (.0001,  .9999).  The  fact  that  the  curves  are  not  straight  lines 
over  this  range  means  that  a  Gaussian  approximation  to  the  power 
spectral  estimate  would  not  suffice.  However,  the  Gaussian  approxima¬ 
tion  would  be  a  fairly  good  one  for  larger  BT  and  P  [see  figure  20,  for 
example ) . 
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The  fact  that  the  curves  in  figure  2  are  virtually  identical  for 
overlaps  greater  than  50%  means  that  there  is  little  point  in  choosing 
overlaps  greater  than  this  amount.  This  confirms  the  choices  of  over¬ 
lap  made  in  reference  1,  where  attention  was  confined  to  the  equivalent 
degrees  of  freedom.  The  ideal  distribution  would  be  a  vertical  line 
at  v  =  1;  the  closeness  of  these  curves  to  the  ideal  is  a  measure  of 
the  spread  of  the  spectral  estimate. 

The  corresponding  results  for  the  approximation  (41)  are  presented 
in  figure  3.  The  curves  are  virtually  identical  to  those  of  figure  2 
over  the  complete  range  of  probabilities  considered,  for  various  values 
of  BT  and  overlap. 

For  a  cubic  window,  the  exact  results  and  the  approximation  are 
given  in  figures  4  and  5,  respectively.  The  conclusions  are  identical 
to  those  made  for  the  Hanning  window. 
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FLUCTUATIONS  OF  CROSS  SPECTRAL  ESTIMATE 

This  topic  is  not  directly  related  to  the  earlier  material  on 
autospectral  estimation;  however,  it  is  an  important  observation  and 
merits  a  comment.  For  two  uncorrelated  processes,  x  and  y,  the  cross 

A 

spectrum  G  (f)  =  0.  However, the  cross  spectral  estimate , CYV(f} , 


■XV 


satisfies  the  equations  (reference  2): 


=  o, 

=  0, 

{|G  (f) 

t  xy 

and 


where  K  is  the  equivalent  decrees  of  freedom.  Now,  if  \>>1,  J  f  is 

xy 

approximately  complex  Gaussian.  Therefore,  if  we  define  the  amplitude 
est imate 


A  =  !  G  if]  ! , 


xv 


it  has  probability  density  function 

x 


P  (  X  J 

Then  the  mean  or  A  is 


exp  - 


x>0 , 


f >  ’ ■ o 


cxx<f><\.v(f  A'j 


which  is  a  rather  slow  decay  with  K.  Then  the  ratio  of  the  mean 
amplitude,  ,  J  3  ,  to  the  square  root  of  the  product  of  the  aut  o-spec  t  r.t 
i  s 
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_ MM _ 

GxxC^GyyCf)]1* 


1.253  . 


(46) 


If,  for  example,  K  =  32,  this  ratio  is  .222  which  is  -6,55  dB;  this  is 
not  very  far  down  relative  to  unity  coherence, though  the  two  processes 
are  uncorrelated. 

Also, 


Var  |a| 


G  (f)G  (f) 
xx  yy 


(47) 


and,  therefore, 


Standard  deviation  JaI  .  , 

Mean  |A^  \  7  / 


0.52, 


(48) 


independent  of  K  (or  P) ,  So  for  a  zero  cross-spectrum  value,  A  = 

iG  (f) I  will  always  have  the  same  amount  of  relative  variation, 
xy  - 

regardless  of  the  number  of  pieces  P  (for  large  P)  ;  thus,  on  a  dB 
scale,  the  "ribbon  width"  of  the  cross-spectral  estimate  is  indepen¬ 
dent  of  P,  when  the  two  processes  are  uncorrelated. 
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DISCUSSION 


An  exact  expression  for  the  characteristic  function  of  the  power 
spectral  estimate  of  a  pure  tone  in  Gaussian  noise  has  been  attained, 
and  then  specialized  to  noise-alone,  In  the  noise-alone  case,  a 
numerical  computation  of  the  cumulative  distribution  function  has  been 
conducted.  Comparison  of  the  latter  with  an  approximation  utilizing 
only  the  mean  and  variance  shows  excellent  agreement  over  a  wide  range 
of  probabilities,  regardless  of  the  exact  window,  overlap,  or  the  time- 
bandwidth  product.  This  means  that  concentration  on  the  equivalent 
degrees  of  freedom,  particularly  on  its  maximization,  is  sufficient 
for  a  probabilistic  description  of  the  auto-spectral  estimate. 

Maximizing  the  equivalent  degrees  of  freedom  results  in  a  narrower 
probability  density  function,  as  witnessed  by  the  increased  steepness 
of  the  cumulative  probability  distributions  presented. 

An  entirely  different  method  of  auto-  and  cross-spectral  estimation 
has  been  presented  in  references  7  and  8,  and  is  mentioned  here  as  a 
viable,  attractive  alternative,  particularly  for  short  data  segments. 
Since  only  a  few  parameters  are  estimated,  the  estimates  are  potentially 
more  stable,  whereas  the  technique  considered  here  (and  in  reference  1) 
assigns  independent  degrees  of  freedom  to  each  and  every  frequency 
cell  of  interest  and,  therefore,  requires  the  estimation  of  many  more 
parameters. 
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Appendix  A 


DERIVATION  OF  CHARACTERISTIC  FUNCTION 


The  first  half  of  appendix  C  of  reference  9  considers  the  Hermitian 


F  =  xhbx. 


(A- 1 ) 


with  mean  and  covariance  of. the  complex  random  variable  matrix  X, 

E  |X}  =  m  ,  Cov{X{  =  e  |(X-nri)(X-  m)Hj  =K  ,  (A-2) 


where  matrix  X  is  P  x  1,  and  matrix  B  is  Hermitian  and  P  x  P.  Defining 


P  x  P  matrix 


A  *  K^BK5*  . 


(A-3) 


with  corresponding  normalized  modal  matrix  Q  and  (diagonal)  eigenvalue 


matrix  X ,  we  can  express  (A-l)  as 


F=V"XV=Z 


(A-4) 


where  matrix  V  is  P  x  1  with  mean  and  covariance 


=QhK =  Covjv}  =1  . 


(A-5) 


Then  a  slight  general ization* of  the  second  half  of  appendix  C 
of  reference  9  (see  also  reference  10)  yields  the  characteristic 
function  of  random  variable  F  in  (A-4)  as 


*We  must  also  have  E|(X-m)(X"m)T|  =  in  add 


ition  to  (A-2) 


A-l 
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c«)  -n 

k=  1 


(A-6) 


where  {*.  ;  and  are  the  elements  of  matrices  X.  and  p.  .  The  cumulants 


of  F  follow  easily  from  (A-6)  as 


c  =  (n-1)  ! 
n 


^  (•  )  ■ 


f  A  - " ; 


k=l 


In  particular,  the  first  two  cumulants  are 

P 

Mean  |f|  =  ^  Xk  (MuJ2) 

p  k=l 

Var  {Fj  =  c:  ^  ^-“k^  ' 

k=l 

For  the  case  of  zero-mean  variables,  i.e.,  m=  O,  ,A-5  yields 
p=  O,  and  the  characteristic  function  becomes 


A-  S 


C(C)  =  al-iA.O 


n 

k  =  l 

The  cumulants  are  then 


r-i'H  for  zero-mean  variables 


■V  -  9  ■ 


i  A- 10) 


=  (n- 1 ) !  for  zero-mean  variables. 

k=l 

1 

(it  is  not  necessary  to  evaluate  K  for  eigenvalue  purposes  alone, 
because  the  eigenvalues  •  >.  •  of  matrix  A  defined  in  ( A  -  5 ,  are  the  same 
as  the  eigenvalues  of  KB  or  BK  ) 


As  a  specific  application  of  the  general  results  above,  we  consider 
B=l  ,  m  =  pn1...mp]T.  K  =  [Kp.q]  ■  (A-ll) 

Then  from  (A-3) ,  we  see  that  A  =  K  .  In  order  to  evaluate  (A-8) ,  we 
notice  that 

k=l  p=l 


V  Xk|yk|= 


k=l 


m 


HK^AK^m  =  mH  m  = 


S 


m. 


P  P  P-1 

A  A  »  \  '  |K  I2  =' 


k=l  P ,q=l  p,q=l  k= 1 -P 


(A-13) 


2  ,  (A- 14) 


z 


k=l 


A 2 1 u  |2  =  fi  H\\*  =  m  HK'^  Q\AQKK'1"2m 

k  k 


=  mH  K  A  Q  X  Q  H  K  -5s  m  =  mHKhKKK~hm  =mHKm,  CA-iS) 

Then  (A-8)  yields 

P 

Mean  |f|  =  c,  =  PK  +/  | I  2  > 

1  o  /  j  k 

P-1  k=1 

Var  Jf}  =  c  ^  ^  ^  (P- 1  k  j )  )  KR  | 2  +  2fflHKm  . 


C A- 16) 


k=l  -P 


The  specialization  to  zero-mean  variables  is  obtained  by  dropping  the 
last  terms  in  (A-16) . 
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Appendix  B 


DERIVATION  OF  COVARIANCE  MATRIX 


We  are  interested  in  deriving  the  two  averages 


ElY  Y  *1  and  EiY  Y  i 
|  pn  qn|  (  pn  qnj 


(B-l) 


because  they  are  needed  for  appendix  A  and  to  see  if  the  conditions 
required  there  are  satisfied.  We  have,  from  (7), 

e)y  Y  *j.  dt  du  exp(-i2Trf(t-u))  E|n(t)n*(u)|  w^t~L- (p-l)sj  . 


“  — 

w*  u-i-L- (q-l)S 

7 

C  “  J 


(B-2) 


Letting  the  noise  correlation  in  (B-2)  be  denoted  by  Rn(t-u),  and  its 


spectrum  by  Gn,  (B-2)  becomes 


{YpnYqn}  =  /  dW  Gn(y) J dt  exp( i2lT  (M-  f) r )  w  (p-l)sj  . 

{/du  exp(i2TT(y-f)u)  w  (u-^-(q-l)sJ  j 


du  Gn(u) |W(f-w) 

This  quantity  is  a  function  only  of  the  difference  of  indices  q  and  p. 

If  spectral  window  j W | 2  is  narrower  than  the  detail  in  noise 
spectrum  Gn  in  the  neighborhood  of  analysis  frequency  f,  (B-3)  simplifies 
to 


exp  i2TT(f-u)  (q-p)S 


(B-3) 


B-l 
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E  {YpnYqn|  =  Gn ( f )y*dw  |  W(  f-u)  i  2  exp£i2ir(f-u)  (q-p)sj 

*  Gn(f)<(,w((^-P)S)* 

where 

^w(T)  =  /  dt  w( t )w* (t-T ) 

is  the  autocorrelation  function  of  data  window  w 
Now  let 

VmS) 

"*w(0)  " 

Then 

E^Y  Y  =  G  (f)<|>  (0)  r 
(  pn  qn)  n  w  q-p 

and  from  (10), 

K  =  G  (f)0  CO)  R  ,  -B-S 

n  w 

where  P  x  P  matrix 


is  Hermitian,  Toeplitz,  and  non-negative  definite.*  For  real  weighting 
w,  R  is  a  real  symmetric  Toeplitz  matrix.  The  matrix  in  (B-S)  is  the 
one  required  in  ( A  - 1 1 )  and  (10). 

*This  property  is  easilv  proven  bv  use  of  definitions  (B-5)  and 
(B-b). 

B-2 


(B-b 


t  B-“) 


!  B-4  ) 


i  B-5 
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The  second  quantity  we  desire  is,  by  use  of  (7), 

E  |YpnYqn|  =JJdt  du  exp(-i27Tf(t*u))  E|n(t)n(u)|  w^t-^JL- (p-l)sj  . 

w  Ju-^L-tq-nsj  •  (B-10) 

Letting  the  noise  correlation  in  (B-10)  be  denoted  by(Rn(t-u),  and  its 
spectrum  (Fourier  transform)  by  (J  ,  (B-10)  becomes 

{' 


EjY  Y 
pn  qn 


|  -/duQ.Oo/dt  exp(i2T(y-f)t)  w|t-ji-(p-l)sJ  . 

J "  du  exp(-i2ir(y+f)t)  w  ^u-jL- (q-l)sj 
=y"dyQ  (y)W(f-y)W(f+y)  exp£-i2Trf (L-2S+pS+qS) -i2iTy (q-p)sj  <  (B-ll) 


If  analysis  frequency  f  is  greater  than  the  bandwidth  B  of  spectral 
window  W,  then  W(f-y)  and  W(f+y)  do  not  overlap  on  the  y-axis,  and 
(B-ll)  yields 

E/y  Y  \  S  0  if  f>B.  (B-12) 

(  pn  qnf 

Thus,  the  property  desired  in  appendix  A  (footnote  to  equation  (A-6)) 
holds  true  if  f>B. 
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Appendix  C 


NUMERICAL  COMPUTATION  OF  CUMULATIVE  PROBABILITY  DISTRIBUTION 


The  numerical  computation  of  the  cumulative  probability  distribution 
Prob(g<v)  is  not  accomplished  here  directly  via  the  sum  in  (36).  The 
reason  is  that,  for  large  P,  (36)  is  an  alternating  sum  of  terms  of 
large  magnitude,  and  accuracy  is  lost  in  the  final  result.  Instead, 
the  methods  in  references  3  and  4  are  utilized  on  characteristic 
function  (35):  for  a  random  variable  limited  to  positive  values,  the 
cumulative  probability  distribution  can  be  expressed  as  (reference  4) 


P(v)  =1-7 


2  (  r  MO  ) 

'iReW  dS  -JX~  exP(-iSv)V  ,  v>0, 


(C-l) 


where  f^(£)  is  the  imaginary  part  of  the  characteristic  function  f(£). 
We  have  f^(£)/£«* E  { g }  =  1  as  £  -*■  0.  We  approximate  (C-l)  according  to 


P(v)  S  1 


Hi" 


’2  d£  exp(-iCv)}, 


(C-2) 


and  then  sample  and  approximate  this  expression  according  to 


(C-3) 


2  (  fi^k  ^  r 

P(n  Av)  S  l  -  -  RejA^  w  -  exp  1-ikAC  n  AvI  >  ,  (C- 

i  i  k=0  k 

where  L  A£  =  £2,  and  <w^>  are  Trapezoidal  weights  of  integration.  We 


choose  sampling  increment 


N  Av  ' 


(C— 4 ) 


where  N  is  chosen  large  enough  that  f^(£)/£  does  not  change  much  in 
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A£.  Then 


L 

P(n  Av)  -l  -  ~  Re  |a£^^  wk  1  exp  £-i2tTkn/NfJ  j 


N-l 


2 

IT 


53  gk  eXP['i2T 


=  1  -  ^  AC  Re{  }  g  exp  -i2rkn/N 


C  C  -  5 ) 


where 


k=0 


_ j 

E 

J=0 


/  x _ _ _ _ 

k*jN  (k+jN)  A£ 


,  ^  L*1  1*1 

0<k<N-l,  — -1  £JS~ 


(C-6) 


Equation  (05)  is  an  N-point  FFT;  therefore,  we  choose  N  as  a  power  of  2 
for  speed  purposes. 

The  only  remaining  question  is  the  choice  of  limit  in  C-2'i. 
From  (.55)  ,  we  know  that 

jf.  (C)J  i  }f(Oj  A. _ ! _ rco 

1  p 

n  n:*x  {*•  rP!i 

p=i 

where  r  =  A ^  ^;F.  Therefore 

P  p 

|f.(c)l  1 - 

11”“  {rPc} 

p=IP 


C-2 


(C-8) 
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where  IP  can  be  1  or  2  or  ...  or  P.  Therefore  the  error,  E,  in  using 
(C-2)  rather  than  (C-l)  is  bounded  according  to 


This  equation  can  be  solved  for 


2 


IT 


nw 


-p+ip-i 

£2 _ 

p*i-ip 


(C-9) 


1 


5,  =  P 


«n  {^icp^mp) 

p=ip 


p+i-ip 


(c- 10) 


with  the  guarantee  that  the  error  will  be  less  than  E  if  we  choose  £2 

according  to  (C-10).  Since  IP  is  not  unique,  we  choose  £2  to  be  the 

minimum  value  over  the  range  of  IP=1,  2,...,  P,  for  then  the  integration 

range  in  (C-2)  can  be  kept  to  a  minimum. 

In  summary,  we: 

specify  Av,  E,  P,  BT 

compute  and  £2 

choose  N  =  1024  (say) 

2tt 

compute  A£  =  N 
compute  L  =  £2/A£ 
let  J  =  (L+l )/N 
compute  (C-6) 

compute  FFT  <j  g^  j  and  printout  (C-5) 

choose  N  =  2048,  go  back  to  step  4,  and  observe  change  in  printout. 
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A  program  for  this  procedure  for  the  Hanning  window  follows.  The 
subroutines  TRIMXD  and  EIGVLD  are  presented  in  reference  11,  and  sub¬ 
routines  DPMCOS  and  DPMFFT  are  given  in  reference  12. 

In  order  to  execute  the  approximation  (41),  the  line  under  state¬ 
ment  number  2  is  changed  to  CALL  PROBA(BT,  P,  Y) .  This  subroutine  for 
the  Cubic  window  is  also  presented  below. 

1 1«  T  _3G  *  p 

ul  -cusir.  .<  'Ol )  .  r  1  5 1 )  #Z  (200)  #  r;.o»M(2*) 

j  A  T  A  YNOP"/ -3. 71 902. -2.29052 #-3.09 023 #-2.6 78  to# -2. 57583 #-2,22635# 
W.  j52  75,-1, 04*85. -1.26 155.-.  84162  #-.6244C',-.a53  35  <  0.  .  ,25335#  .5244 
».  .  j- lo2.1 .2.-.155#  1.64485.2. 0  5  275. 2. 326  35#  2. 57582, 2.e  7  8 16  #3. 0  9023# 
*2. 29052,2.7190a/ 

C=1 ,4405825 1  *  #iA'.\I\6 

CAll  -'CCEso  ( 2  #  y  ) 

CAL-  SLHwL'o  t  2  #  0  .  .r*0Rv(  i)  ,3,  #YUCP;M25)  ) 

Caul  JdjCTou#  H50.  ,  325.  #28  5  0.,  2735.  ) 
go  u  ;gt=2.o 
tjT=2  .  IBT 

C  Acl  Sc.  TS/'o  (  Z  «  29  #  a . ) 

call  L I  ‘.Eau  ( Z  #  0  *  0 . ,  Y.,0K  (1)) 

Crtww  -I.y£5oU*1*0.»y:.CP-  (25)  ) 

CAL._  u:r,6Su(t*l*3.  #y- 09^(25)  ) 

CAul  u  *'it  (  2 #  1  •  3 .  ,  Y‘.C«V  ( 1 )  ) 
call  LIuEoo  (  2  #  1  »  0  .  #  Y\0#tv  ( 1 )  ) 

CAll  SlTSmG (-Z » 20 # i .  ) 

GO  21  -=l#ll 

Call.  linESg  (Z*0#wi*.25#Y'.ORP(l)) 
a.  CAll  L InESo  (  2  #  1  #  J*  . 25 #  r'.CR  - ( 25 )  ) 

GO  a2  --2 >2a 

C  All.  L  I  ’.E  So  (  Z  #  0  •  0  ,  >  Y'.On'-  ( sj)  ) 

aa  CAll  LINESo(Z#  1#  4.  #  Y’.CP'-  (  j)  ) 

CAlL  SlTSZo  i  a.  #  2C  #  a  .  ) 

GO  23  1=1.51 
a3  X(l)=, 06*11-1) 

GO  l  I P  =  1 #4 
P- ( UT/C )  •  IP 
SL= (BT/C-1 , ) / ! P- 1 ) 
p«inr  a,  dT,P#SL 

a  F0R.'**mT (////•  8T  ='E13.8#»  P  =*I4,'  S/L  =’£13.8) 

CAll  PPQt GP ( r)T • P # Y ) 

PRI..T  3,  Y 

3  FORMAT ( /5Ezo  ,  8 ) 

00  4  I=l#5l 
«=MIN(r<I)  ,  ,  999*999  ) 

QsMAXU#  .000001 ) 

*  Y(I)=TtN0HM(G#*l) 

CALL  LINESO ( Z • 5 1 #  x . Y ) 

1  C0MINUE 

CALL  PAGEG(Z#0. l.l) 
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il  continue 

CALL  ExiTu(Z) 

END 

SUBROUTINE  pRCBDP{dT*P,  Af,S) 

PARAMETER  y=100  S  MAXIMUM  number  op  pieces 
PARAMETER  N=2048»N<*1=n/4*1 

DOUBLE  PRECISION  H  < M ♦  M  )  , 0  ( w )  # d  ( v )  .  E  ( “  )  ,  u  (  V  )  » F  { '• )  *  GR  l N  )  ,  G I  ( I . )  *  <-C  ( N4 
SI) »C  »ERROR  ,  qELV  *PI »SL»  TPE  »  XI2  *PR  *  AT  ,  DELXl •  S  *  L , »F ICxI 
INTE6ER  P#P1 
OlMENSION  A'lS(l) 

C=I, 4405825800  8  HANNING 

IF(P.LE.M)  go  TC  1 

JZM 

PRINT  2 ,  P, j 

2  FORMAT(/*  P  Z'14»«  IS  GREATER  than  V  z'\l/) 

DO  3  J=l*5l 

3  ANS(J)s-l. 

RETURN 

1  ERR0R=1.D-U 

DElv=.06D0 

PI=3.l4l592653b8979324J0 

P1=P-1 

SL=(BT/C-1.00)/P1 
DO  4  K=0»Pl 

4  D(K4l)=U(K*SL) 

DO  5  JSX»P 

DO  5  KZl,P 
L-ABS ( J-K ) 

5  R(U,K)=D<L) 

CALL  TRIMXD(P*w*R*O.B) 

CAll  EIGVuU ( P * E * D i d » 4  * F ) 

TPE=2.00/(PI*EHROK> 

XI2S1.0100 
PRsO.DO 
00  6  J=P, 1,-1 
PR=PR>LO&(E( J> ) 

AT=1.UO/(P-J4l.00) 

S=P«EXP(AT,(LCG(TP£*AT)-PR) ) 
fa  XI2=MIN(XI2,S) 

NF=N/2 

7  OElXI=2.DO»PI/ (NF.uELV) 

S=XI2/0ELX1 

JC=(S41,D0)/NF 

N1=nF-1 

DO  d  K=0*N1 

S=0.00 

00  9  j=o*uc 

9  SsS*wFIDXI(  (K4v.*NF)*ndLXI) 

GR(K4l)=S 

a  gI(k+d=o.do 

CALL  DPMCOS (CO,NF ) 

U=1,4427*U0G(NF )4.5 
CALL  OPmFPt (GR,GI ,CC, J,-l) 

S=2.00*OELXI/PI 
DO  10  usl#51 
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10  AUi,<J)  =  l.O0-S*GR(J) 

IF(.,F. ria..*)  HETUR.« 

00  1 1  U  =  1  t  46  /  5 

11  print  I2r  A  <S ( - )  * ANS ( J  + 1 )  * Ar.S (^*2 )  # tt:1  (  u*3 ) 
PRI*T  lit  ANS151) 

1.2  FORMAT  (  /5c2C  »8  ) 

NF=r. 

GO  TO  t 

FUNCTION  u  i  T  )  ‘ 

OOudUE  PRECISION  T»SI 
IF(r,3c.l.wO>  oC  T0  1 
SIs2.0C*Pl«r 

U-2.00/3.u0* ( 1 .CO— T ) « ( l ,D0*.^uC*CG5 (SI )  )  +  <  . 
RETURN 
1  U=0.D0 

RETURN 

function  wfioxicxj 

QOUoUE  PPc.ClSlC’>  a  •  X  TOP  i  Ai_  ,  RE  *  3  I  *  TCVP  »  S~ 

IF(X.GT.O.UO)  GC  TO  1 

ATOMS 1 .0100 

wFluXl=.500 

RETURN 

1  IF  (X.GT.XTwP)  GC  TO  3 
ALsl.DO 

B£  =  -E ( P ) «X/P 
CO  2  JI=1,P1 
QI=£ ( JI ) «X/p 
TEMPrAL*6E.jI 
aE=dE-AL*ctI 

2  AL=TEVP 
SQ=Ai_*AU  +  uE*b£ 

IF(SO*(X«£l?wOR)«*^.GT,iP.OO)  x  T  Cp-"'  I; .  (  X TCP  > 
«F  IOX  I=-6t./  (  SG»X  ) 

RETURN 

J  *FlUXl=0.J0 

RETURN 

ENC 

SUBROUTINt  PRCB  A ( QT  tP , ANS ) 

DOUBLE  PRECISION  GC.O,bV.FllL 

INTEGER  Pt PI 

DIMENSION  AnS(D 

C=l. 82009566  •  CUBIC 

P1=P-1 

SU=(BT/C-1, )/Pl 
B=l. 

DO  1  K=1.P1 

1  8=0+2. *(1. -FLOAT  (K)/P)*U(K*SL)**2 

CAPK=2.*F/e 
PRINT  101»  CAP* 

101  FORMAT ( / '  CAPK  IS  'E15.8) 

b=p/b 

18=8 

F8=B-I8 

CAUL  GAMMA(i,>F8rGrS2rt2) 

GO=LOG(CBlE(G) ) 

00  5  K=lrle 
C=F0*K 
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Acoustic  signal  processing,  4513 
All-pole  filter  model,  5303 
Approximate  distribution,  5529 
Approximate  spectral  analysis,  4767 
Approximations  for  statistics,  5291 
Autoregression,  5303,  5505 
Average  magnitude  error,  4113 
Bad  data  points,  5303 
Bias,  5303 

Bias  of  coherence  estimators,  5291 

Burg  technique,  5303,  5505 

Characteristic  function,  1032 

Coherence  estimation,  5291 

Complex  univariate  process,  5505 

Cross- spectral  estimation,  4169-S 

Cumulative  probability  distribution 
function,  1032,  3012 

Delay  weighting,  4767 

Dolph-Chebyshev  weighting,  4767 

Equivalent  degrees  of  freedom,  5529 

Equivalent-noise  bandwidth,  4513 

Fast  Four  Transform  (FFT),  3012, 

4113,  4169,  4169-S,  4513,  4767 

Forward  and  backward  averaging,  5303, 
5419,  5501 

Generalization  of  Burg's  algorithm, 
5419,  5501 

Half-power  bandwidth,  4169,  4169-S, 
4513 


Subject  Matter  Index 

Linear  prediction,  5303,  5419,  5501 
5505 

Magnitude  coherence,  5291 

Magnitude- squared  coherence,  5291 

Maximum  entropy,  5303,  5501,  5505 

Mean -square  error  of  coherence 
estimators,  5291 

Minimum-bias  windows,  4513 

Multivariate  process,  5419,  5501 

Overlapped  FFT  processing,  4513,  5529 

Overlapped  time  weighting,  4767 

Probability  distribution,  5S29,  1032 

Resolution,  5303 

RMS  bandwidth,  4513 

Short  modified  periodograms ,  4169, 

4169-S 

Spectral  analysis,  5303,  5419,  5501, 

5505 

Spectral  estimation,-  4169,  4169-S,  4513, 
5291,  5529 

Statistics  of  coherence  estimators,  5291 
Statistical  bandwidth,  4169,  4169-S 
Time  delay  measurements,  5291 
Variance  of  coherence  estimators,  5291 
Vernier  spectral  analysis,  4767 
Weighted  error  matrices,  5501 
Windowed  data,  4169,  4169-S,  5529 
Yule-Walker  equations,  5303 
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5  60s60+L06(C) 

DO  3  Ksi'bl 
V=.06*(K-1) 

IF (V,GT ,0.)  60  TO  6 

ANS(K)=0. 

60  TO  3 

6  BVs8$V 

ANS(K)sEXP(B*LC6(6V)-aV4FllL(0BLE.(B4l«)*8V)-60) 

3  CONTINUE 
RETURN 

2  PRINT  H ,  B 

4  FORMAT (/*  PROBLEM  AT  B  s  *£15.8) 

RETURN 

FUNCTION  FllL(AtXU) 

DOUBLE  PRECISION  so, to. AO, XD. A 

SOsl.OO 

TDsi.OO 

AO* A- 1,00 

00  1  KS1.1000 

T0sT0*X0/(A04K) 

SOsSO*TO 

1  IF(ABS(T0),lE.I.0-6*ABS(S0))  60  TO  2 

PRINT  3, 

3  FORMAT (/•  1000  TERMS*/) 

2  FULsL06(S0) 

RETURN 

FUNCTION  U(T)  6  CUBIC 
IF(T.6E.l.)  60  TO  1 
U*102H./13l.*(l.-T>«*7 
IF(T.6E.0.?S)  RETURN 
UXU-B162./191.*(.79-T)«*7 
IF (T ,6E,0.5)  RETURN 
U«U426672 • / 151 . • ( . S-T ) *«7 
IF (T,6£,0,25)  RETURN 
USU-S73MM . / 151 . • ( , 25-T ) **7 
RETURN 
1  U*0. 

RETURN 

ENO 
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